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(57) Abstract: A single chain, polypeptide fusion protein, comprising: a non-cytotoxic protease, or a fragment thereof, which 
protease or protease fragment is capable of cleaving a protein of the exocytic fusion apparatus of a nociceptive sensory afferent; 
a Targeting Moiety that is capable of binding to a Binding Site on the nociceptive sensory afferent, which Binding Site is capable 
of undergoing endocytosis to be incorporated into an endosome within the nociceptive sensory afferent; a protease cleavage site 
at which site the fusion protein is cleavable by a protease, wherein the protease cleavage site is located between the non-cytotoxic 
protease or fragment thereof and the Targeting Moiety; and a translocation domain that .is capable of translocating the protease or 
protease fragment from within an endosome, across the endosomal membrane and into the cytbsol of the nociceptive sensory afferent. 
Nucleic acid sequences encoding the polypeptide fusion proteins, methods of preparing same and uses thereof are also described. 
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Fusion Proteins 

This invention relates to nbn-cytbtoXic fusion proteins, and to the therapeutic 
5 application thereof as analgesic molecules. 

Toxins may be generally divided into two groups according to the type of effect 
that they have on a target cell. In more detail, the first group of toxins kill their 
natural target cells, and are therefore known as cytotoxic toxin molecules. This 

10 group of toxins is exemplified inter alia by plant toxins such as ricin, and abrin, 
and by bacterial toxins such as diphtheria toxin, and Pseudomonas exotoxin A. 
Cytotoxic toxins have attracted much interest in the design of "magic bullets" (e.g. 
immunoconjugates, which comprise a cytotoxic toxin component and an antibody 
that binds to a specific marker on a target cell) for the treatment of cellular 

15 disorders and conditions such as cancer. Cytotoxic toxins typically kill their target 
cells by inhibiting the cellular process of protein synthesis. 

The second group of toxins, which are known as non-cytotoxic toxins, do not (as 
their name confirms) kill their natural target cells. Non-cytotoxic toxins have 
20 attracted much less commercial interest than have their cytotoxic counterparts, 
and exert their effects on a target cell by inhibiting cellular processes other than 
protein synthesis. Non-cytotoxic toxins are produced by a variety of plants, and 
by a variety of microorganisms such as Clostridium sp. and Neisseria sp. 

25 Clostridial neurotoxins are proteins that typically have a molecular mass of the 
order of 150 kDa. They are produced by various species of bacteria, especially 
of the genus Clostridium, most importantly C. tetani and several strains of C. 
botulinum, C. butyricum and C. argentinense. There are at present eight different 
classes of the clostridial neurotoxin, namely: tetanus toxin, and botulinum 

30 neurotoxin in its serotypes A, B, C1, D, E, F and G, and they all share similar 
structures and modes of action. 
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Clostridial neurotoxins represent a major group of non-cytotoxic toxin molecules, 
and are synthesised by the host bacterium as single polypeptides that are 
modified post-translationally by a proteolytic cleavage event to form two 
5 polypeptide chains joined together by a disulphide bond. The two chains are 
termed the heavy chain (H-chain), which has a molecular mass of approximately 
100 kDa, and the light chain (L-chain), which has a molecular mass of 
approximately 50 kDa. 

10 L-chains possess a protease function (zinc-dependent endopeptidase activity) 
and exhibit a high substrate specificity for vesicle and/or plasma membrane 
associated proteins involved in the exocytic process. L-chains from different 
clostridial species or serotypes may hydrolyse different but specific peptide bonds 
in one of three substrate proteins, namely synaptobrevin, syntaxin or SNAP-25. 

15 These substrates are important components of the neurosecretory machinery. 

Neisseria sp., most importantly from the species N. gonorrhoeae, produce 
functionally similar non-cytotoxic proteases. An example of such a protease is 
IgA protease (see W099/58571). 

20 

It has been well documented in the art that toxin molecules may be re-targeted to 
a cell that is not the toxin's natural target cell. When so re-targeted, the modified 
toxin is capable of binding to a desired target cell and, following subsequent 
translocation into the cytosol, is capable of exerting its effect on the target cell. 

25 Said re-targeting is achieved by replacing the natural Targeting Moiety (TM) of 
the toxin with a different TM. In this regard, the TM is selected so that it will bind 
to a desired target cell, and allow subsequent passage of the modified toxin into 
an endosome within the target cell. The modified toxin also comprises a 
translocation domain to enable entry of the non-cytotoxic protease into the cell 

30 cytosol. The translocation domain can be the natural translocation domain of the 
toxin or it can be a different translocation domain obtained from a microbial 
protein with translocation activity. 
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For example, WO94/21300 describes modified clostridial neurotoxin molecules 
that are capable of regulating Integral Membrane Protein (IMP) density present at 
the cell surface of the target cell. The modified neurotoxin molecules are thus 
5 capable of controlling cell activity (e.g. glucose uptake) of the target cell. 
W096/33273 and WO99/17806 describe modified clostridial neurotoxin 
molecules that target peripheral sensory afferents. The modified neurotoxin 
molecules are thus capable of demonstrating an analgesic effect. WO00/10598 
describes the preparation of modified clostridial neurotoxin molecules that target 

10 mucus hypersecreting cells (or neuronal cells controlling said mucus 
hypersecreting cells), which modified neurotoxins are capable of inhibiting 
hypersecretion from said cells. WO01/21213 describes modified clostridial 
neurotoxin molecules that target a wide range of different types of non-neuronal 
target cells. The modified molecules are thus capable of preventing secretion 

15 from the target cells. Additional publications in the technical field of re-targeted 
toxin molecules include: WO00/62814; WO00/04926; US5,773,586; 
W093/15766; WOOO/61192; and W099/58571. 

The above-mentioned TM replacement may be effected by conventional chemical 
20 conjugation techniques, which are well known to a skilled person. In this regard, 
reference is made to Hermanson, G.T. (1996), Bioconjugate techniques, 
Academic Press, and to Wong, S.S. (1991), Chemistry of protein conjugation and 
cross-linking, CRC Press. 

25 Chemical conjugation is, however, often imprecise. For example, following 
conjugation, a TM may become joined to the remainder of the conjugate at more 
than one attachment site. 

Chemical conjugation is also difficult to control. For example, a TM may become 
30 joined to the remainder of the modified toxin at an attachment site on the 
protease component and/ or on the translocation component. This is problematic 
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when attachment to only one of said components (preferably at a single site) is 
desired for therapeutic efficacy. 

-Thusr chemical conjugation results m -a--mixed population of modified toxin 
5 molecules, which is undesirable. 

As an alternative to chemical conjugation, TM replacement may be effected by 
recombinant preparation of a single polypeptide fusion protein (see 
WO98/07864). This technique is based on the in vivo bacterial mechanism by 
10 which native clostridial neurotoxin (i.e. holotoxin) is prepared, and results in a 
fusion protein having the following structural arrangement: 

NH 2 - (protease component] - [translocation component] - [TM] - COOH 

15 According to WO98/07864, the TM is placed towards the C-terminal end of the 
fusion protein. The fusion protein is then activated by treatment with a protease, 
which cleaves at a site between the protease component and the translocation 
component. A di-chain protein is thus produced, comprising the protease 
component as a single polypeptide chain covalently attached (via a disulphide 

20 bridge) to another single polypeptide chain containing the translocation 
component plus TM. Whilst the WO98/07864 methodology follows (in terms of 
structural arrangement of the fusion protein) the natural expression system of 
clostridial holotoxin, the present inventors have found that this system may result 
in the production of certain fusion proteins that have a substantially-reduced 

25 binding ability for the intended target cell. 

There is therefore a need for an alternative or improved system for constructing a 
non-cytotoxic fusion protein. 

30 The present invention addresses one or more of the above-mentioned problems 
by providing a single chain, polypeptide fusion protein, comprising: 
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a. a non-cytotoxic protease, or a fragment thereof, which 
protease or protease fragment is capable of cleaving a 
protein of-the exocytie fusion apparatus in a nociceptive 

5 sensory afferent; 

b. a Targeting Moiety that is capable of binding to a Binding 
Site on the nociceptive sensory afferent, which Binding Site 
is capable of undergoing endocytosis to be incorporated into 
an endosome within the nociceptive sensory afferent; 

10 c. a protease cleavage site at which site the fusion protein is 

cleavable by a protease, wherein the protease cleavage site 
is located between the non-cytotoxic protease or fragment 
thereof and the Targeting Moiety; and 
d. . a translocation domain that is capable of translocating the 

15 protease or protease fragment from within an endosome, 

across the endosomal membrane and into the cytosol of the 
nociceptive sensory afferent. 



The WO98/07864 system works well for the preparation of conjugates having a 
20 TM that requires a C-terminal domain for interaction with a Binding Site on a 
target cell. In this regard, WO98/07864 provides fusion proteins having a C- 
terminal domain that is "free" to interact with a Binding Site on a target cell. The 
present inventors have found that this structural arrangement is not suitable for 
all TMs. One such category of TM is a group of TMs that binds to nociceptive 
25 sensory afferents. In more detail, the present inventors have found that the WO 
98/07864 fusion protein system is not optimal for TMs requiring a N-terminal 
domain for interaction with a binding site on a nociceptive sensory afferent. This 
problem is particularly acute with TMs that require a specific N-terminus amino 
acid residue or a specific sequence of amino acid residues including the N- 
30 terminus amino acid residue for interaction with a binding site on a nociceptive 
sensory afferent. 
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In contrast to WO98/07864, the present invention provides a system for preparing 
non-cytotoxic conjugates, wherein the TM component of the conjugate includes 
the relevant binding domain in an intra domain or an amino acid sequence 
located -towards -the- middle (ie.- of the linear -peptide sequence) of the TM, or 
5 preferably located towards the N-terminus of the TM, or more preferably at or 
near to the N-terminus. The N-terminal domain is capable of binding to a Binding 
Site on a nociceptive sensory afferent, and the TM preferably has a requirement 
for a specific and defined sequence of amino acid residue(s) to be free at its N- 
terminus. 

10 

The non-cytotoxic protease component of the present invention is a non-cytotoxic 
protease, or a fragment thereof, which protease or protease fragment is capable 
of cleaving different but specific peptide bonds in one of three substrate proteins, 
namely synaptobrevin, syntaxin or SNAP-25, of the exocytic fusion apparatus in a 

15 nociceptive sensory afferent. These substrates-are important components of the 
neurosecretory machinery. The non-cytotoxic protease component of the present 
invention is preferably a neisserial IgA protease or a fragment thereof or a 
clostridial neurotoxin L-chain or a fragment thereof. A particularly preferred non- 
cytotoxic protease component is a botulinum neurotoxin (BoNT) L-chain or a 

20 fragment thereof. 

The translocation component of the present invention enables translocation of the 
non-cytotoxic protease (or fragment thereof) into the target cell such that 
functional expression of protease activity occurs within the cytosol of the target 

25 cell. The translocation component is preferably capable of forming ion-permeable 
pores in lipid membranes under conditions of low pH. Preferably it has been 
found to use only those portions of the protein molecule capable of pore- 
formation within the endosomal membrane. The translocation component may be 
obtained from a microbial protein source, in particular from a bacterial or viral 

30 protein source. Hence, in one embodiment, the translocation component is a 
translocating domain of an enzyme, such as a bacterial toxin or viral protein. The 
translocation component of the present invention is preferably a clostridial 



WO 2006/059093 



PCT/GB2005/004585 



neurotoxin H-chain or a fragment thereof. Most preferably it is the H N domain (or 
a functional component thereof), wherein H N means a portion or fragment of the 
H-chain of a clostridial neurotoxin approximately equivalent to the amino-terminal 
half of the H-chain, or-the domain- corresponding to that fragment in the intact H- 
5 chain. 

The TM component of the present invention is responsible for binding the 
conjugate of the present invention to a Binding Site on a target cell. Thus, the 
TM component is simply a ligand through which a conjugate of the present 
10 invention binds to a selected target cell. 

In the context of the present invention, the target cell is a nociceptive sensory 
afferent, preferably a primary nociceptive afferent (e.g. an A-fibre such as an A5- 
fibre or a C-fibre). Thus, the conjugates of the present invention are capable of 

15 inhibiting neurotransmitter or neuromodulator [e.g. glutamate, substance P, 
calcitonin-gene related peptide (CGRP), and/ or neuropeptide Y] release from 
discrete populations of nociceptive sensory afferent neurons. In use, the 
conjugates reduce or prevent the transmission of sensory afferent signals (e.g. 
neurotransmitters or neuromodulators) from peripheral to central pain fibres, and 

20 therefore have application as therapeutic molecules for the treatment of pain, in 
particular chronic pain. 

It is routine to confirm that a TM binds to a nociceptive sensory afferent. For 
example, a simple radioactive displacement experiment may be employed in 

25 which tissue or cells representative of the nociceptive sensory afferent (for 
example DRGs) are exposed to labelled (e.g. tritiated) ligand in the presence of 
an excess of unlabelled ligand. In such an experiment, the relative proportions of 
non-specific and specific binding may be assessed, thereby allowing confirmation 
that the ligand binds to the nociceptive sensory afferent target cell. Optionally, 

30 the assay may include one or more binding antagonists, and the assay may 
further comprise observing a loss of ligand binding. Examples of this type of 
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experiment can be found in Hulme, E.C. (1990), Receptor-binding studies, a brief 
outline, pp. 303-311, In Receptor biochemistry, A Practical Approach, Ed. E.C. 
Hulme, Oxford University Press. 

5 The fusion proteins of the present invention generally demonstrate a reduced 
binding affinity (in the region of up to 100-fold) for nociceptive sensory afferent 
target cells when compared with the corresponding 'free' TM. However, despite 
this observation, the fusion proteins of the present invention surprisingly 
demonstrate good efficacy. This can be attributed to two principal features. First, 

10 the non-cytotoxic protease component is catalytic - thus, the therapeutic effect of 
a few such molecules is rapidly amplified. Secondly, the receptors present on the 
nociceptive sensory afferents need only act as a gateway for entry of the 
therapeutic, and need not necessarily be stimulated to a level required in order to 
achieve a ligand-receptor mediated pharmacological response. Accordingly, the 

15 -fusion proteins of the present invention may be administered at a dosage that is 
much lower that would be employed for other types of analgesic molecules such 
as NSAIDS, morphine, and gabapentin. The latter molecules are typically 
administered at high microgram to milligram (even up to hundreds of milligram) 
quantities, whereas the fusion proteins of the present invention may be 

20 administered at much lower dosages, typically at least 10-fold lower, and more 
typically at 100-fold lower. 

The TM preferably comprises a maximum of 50 amino acid residues, more 
preferably a maximum of 40 amino acid residues, particularly preferably a 
25 maximum of 30 amino acid residues, and most preferably a maximum of 20 
amino acid residues. 

Opioids represent a preferred group of TMs of the present invention. Within this 
family of peptides is included enkephalins (met and leu), endomorphins 1 and 2, 
30 p-endorphin and dynorphin. Opioid peptides are frequently used in the clinic to 
modify the activity to nociceptors, and other cells involved in the pain response. 
• - -As exemplified by the three-step World Health Organisation Analgesic Ladder, * >•■ * - 
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opioids have entry points into the pharmacological treatment of chronic cancer 
and non-cancer pain at all three stages, underlining their importance to the 
treatment of pain. Reference to opioids embraces fragments, variants and 
derivatives - thereof , -which - retain -the-ability - to bind to- nociceptive -sensory 
5 afferents. 

The TM of the invention can also be a molecule that acts as an "agonist" at one 
or more of the receptors present on a nociceptive sensory afferent, more 
particularly on a primary nociceptive afferent. Conventionally, an agonist has 

10 been considered any molecule that can either increase or decrease activities 
within a cell, namely any molecule that simply causes an alteration of cell activity. 
For example, the conventional meaning of an agonist would include a chemical 
substance capable of combining with a receptor on a cell and initiating a reaction 
or activity, or a drug that induces an active response by activating receptors, 

15 whether the response is an increase or decrease in cellular activity. 

However, for the purposes of this invention, an agonist is more specifically 
defined as a molecule that is capable of stimulating the process of exocytic fusion 
in a target cell, which process is susceptible to inhibition by a protease (or 
20 fragment thereof) capable of cleaving a protein of the exocytic fusion apparatus in 
said target cell. 

Accordingly, the particular agonist definition of the present invention would 
exclude many molecules that would be conventionally considered as agonists. 
25 For example, nerve growth factor (NGF) is an agonist in respect of its ability to 
promote neuronal differentiation via binding to a TrkA receptor. However, NGF is 
not an agonist when assessed by the above criteria because it is not a principal 
inducer of exocytic fusion. In addition, the process that NGF stimulates (i.e. cell 
differentiation) is not susceptible to inhibition by the protease activity of a non- 
30 cytotoxic toxin molecule. 
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The agonist properties of a TM that binds to a receptor on a nociceptive afferent 
can be confirmed using the methods described in Example 10. 

In a preferred embodiment of the invention, the target for the TM is the ORU 
5 receptor. This receptor is a member of the G-protein-coupled class of receptors, 
and has a seven transmembrane domain structure. The properties of the ORLi 
receptor are discussed in detail in Mogil & Pasternak (2001), Pharmacological 
Reviews, Vol. 53, No. 3, pages 381-415. 

10 In one embodiment, the TM is a molecule that binds (preferably that specifically 
binds) to the ORLi receptor. More preferably, the TM is an "agonist" of the ORLi 
receptor. The term "agonist" in this context is defined as above. 

The agonist properties of a TM that binds to an ORLi receptor can be confirmed 
15 using the methods described in Example 10. These methods are based on 
previous experiments [see Inoue et al. 1998 [Proc. Natl. Acad. Sci., 95, 10949- 
10953]), which confirm that the natural agonist of the ORLi receptor, nociceptin, 
causes the induction of substance P release from nociceptive primary afferent 
neurons. This is supported by the fact that: 

20 

> the nociceptin-induced responses are abolished by specific NK1 
receptor (the substance P receptor) antagonists; and 

> pre-treatment of the cells with capsaicin (which depletes substance 
25 P from small diameter primary afferent neurons) attenuates the 

nociceptin-induced responses. 

Similarly, Inoue et al. confirm that an intraplantar injection of botulinum neurotoxin 
type A abolishes the nociceptin-induced responses. Since it is known that BoNT 
30 inhibits the release of substance P from primary afferent neurons (Welch et al., 
2000, Toxicon, 38, 245-258), this confirms the link between nociceptin-ORLi 
interaction and subsequent release of substance>P. 
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Thus, a TM can be said to have agonist activity at the ORLi receptor if the TM 
causes an induction in the release of substance P from a nociceptive sensory 
- afferent neuron (see-Example 1 0). 

In a particularly preferred embodiment of the invention, the TM is nociceptin - the 
natural ligand for the ORL1 receptor. Nociceptin targets the ORL1 receptor with 
high affinity. Examples of other preferred TMs include: 



Code 


Sequence 


Ref. 


SEQ ID No. 


Nociceptin 1-17 


FGGFTGARKSARKLANQ 


[1] 


37,38 


Nociceptin 1-11 


FGGFTGARKSA 


[1] 


39,40 


Nocrceptin[Y10]1-11 


FGGFTGARKYA 


[1] 


41,42 I 


Nociceptin [Y11]1 -11 


FGGFTGARKSY 


[1] 


43,44 


Nociceptin [Y14J1-17 


FGGFTGARKSARKYANQ 


[1] 


45,46 


Nociceptin 1-13 


FGGFTGARKSARK 


[2] 


47,48 


Nociceptin [R14K15] 1- 
17 (also known in this 
specification as "variant" 
nociceptin) 


FGGFTGARKSARKRKNQ 


[3,4] 


49,50 


Peptide agonist 


Peptide agonists from 
combinatorial library 
approach 


[5] 





10 

[1] Mogil & Pasternak, 2001, Pharmacol. Rev., 53, 381-415 
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[2] Maile etal., 2003, Neurosci. Lett., 350, 190-192 

[3] Rizzi ef a/., 2002, J. Pharmacol. Exp. Therap., 300, 57-63 

[4] Okada etal., 2000, Biochem. Biophys. Res. Commun., 278, 493-498 

[5]-Dooley-ef at-., 1997, J Pharmacol Exp-Ther. 283(2), 735-41. 

5 

The above-identified "variant" TM demonstrates particularly good binding affinity 
(when compared with natural nociceptin) for nociceptive sensory afferents. This is 
surprising as the amino acid modifications occur at a position away from the N- 
terminus of the TM. Moreover, the modifications are almost at the C-terminus of 

10 the TM, which in turn is attached to a large polypeptide sequence (i.e. the 
translocation domain). Generally speaking, a TM-containing fusion protein will 
demonstrate an approximate 100-fold reduction in binding ability vis-a-vis the TM 
per se. The above-mentioned "variant" TM per se demonstrates an approximate 
3- to 10-fold increase in binding ability for a nociceptive sensory afferent (e.g. via 

15 the ORL1 receptor) vis-a-vis natural nociceptin. Thus, a "variant" TM-containing 
fusion might be expected to demonstrate an approximate 10-fold reduction in 
binding ability for a nociceptive sensory afferent (e.g. via the ORL1 receptor) vis- 
a-vis 'free* nociceptin. However, the present inventors have demonstrated that 
such "variant" TM-containing fusion proteins demonstrate a binding ability that 

20 (most surprisingly) closely mirrors that of 'free' nociceptin - see Figure 14. 

In the context of the present invention, the term opioid or an agonist of the ORI_i 
receptor (such as nociceptin, or any one of the peptides listed in the table above) 
embraces molecules having at least 70%, preferably at least 80%, more 
25 preferably at least 90%, and most preferably at least 95% homology with said 
opioid or agonist. The agonist homoiogues retain the agonist properties of 
nociceptin at the ORLi receptor, which may be tested using the methods 
provided in Example 10. Similarly, an opioid homologue substantially retains the 
binding function of the opioid with which it shows high homology. 

30 
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The invention also encompasses fragments, variants, and derivatives of any one 
of the TMs described above. These fragments, variants, and derivatives 
substantially retain the properties that are ascribed to said TMs. 

5 In addition to the above-mentioned opioid and non-opioid classes of TMs, a 
variety of other polypeptides are suitable for targeting the conjugates of the 
present invention to nociceptive sensory afferents (e.g. to nociceptors). In this 
regard, particular reference is made to galanin and derivatives of galanin. 
Galanin receptors are found pre- and post-synaptically in DRGs (Liu & Hokfelt, 

10 (2002), Trends Pharm. Sci., 23(10), 468-74), and are enhanced in expression 
during neuropathic pain states. Proteinase-activated receptors (PARs) are also a 
preferred group of TMs of the present invention, most particularly PAR-2. It is 
known that agonists of PAR-2 induce/ elicit acute inflammation, in part via a 
neurogenic mechanism. PAR2 is expressed by primary spinal afferent neurons, 

15 and PAR2 agonists stimulate release of substance P (SP) and calcitonin gene- 
related peptide (CGRP) in peripheral tissues 



A particularly preferred set of TMs of the present invention includes: 



Ligand 


Reference 


Nociceptin 


Guerrini, etal., (1997) J. Med. Chem., 
40, pp. 1789-1793 


p-endorphin 


Blanc, etal., (1983) J. Biol. Chem., 
258(13), pp. 8277-8284 


Endomorphin-1 ; 


Zadina, etal., (1997). Nature, 386, pp. 


Endomorphin-2 


499-502 


Dynorphin 


Fields & Basbaum (2002) Chapter 11, 
In The Textbook of Pain, Wall & Melzack 
eds. 


Met-enkephalin 


Fields & Basbaum (2002) Chapter 1 1 , 
In The Textbook of Pain, Wall & Melzack 
eds. 
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Ligand 


Reference 


Leu-enkephalin 


Fields & Basbaum (2002) Chapter 11, 
In The Textbook of Pain, Wall & Melzack 
eds. 


Galanin 


Xu et at., (2000) Neuropeptides, 34 
(3&4), 137-147 


PAR-2 peptide 


Vergnolle ef a/., (2001) Nat. Med., 7(7), 
821-826 



The protease cleavage site of the present invention allows cleavage (preferably 
controlled cleavage) of the fusion protein at a position between the non-cytotoxic 
protease component and the TM component. It is this cleavage reaction that 
5 converts the fusion protein from a single chain polypeptide into a disulphide- 
linked, di-chain polypeptide. 



According to a preferred embodiment of the present invention, the TM binds via a 
domain or amino acid sequence that is located away from the C-terminus of the 
10 TM. For example, the relevant binding domain may include an intra domain or an 
amino acid sequence located towards the middle (i.e. of the linear peptide 
sequence) of the TM. Preferably, the relevant binding domain is located towards 
the N-terminus of the TM, more preferably at or near to the N-terminus. 

15 In one embodiment, the single chain polypeptide fusion may include more than 
one proteolytic cleavage site. However, where two or more such sites exist, they 
are different, thereby substantially preventing the occurrence of multiple cleavage 
events in the presence of a single protease. In another embodiment, it is 
preferred that the single chain polypeptide fusion has a single protease cleavage 

20 site. 

The protease cleavage sequence(s) may be introduced (and/ or any inherent 
cleavage sequence removed) at the DNA level by conventional means, such as 
by site-directed mutagenesis. Screening to confirm the presence of cleavage 
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sequences may be performed manually or with the assistance of computer 
software (e.g. the MapDraw program by DNASTAR, Inc.). 



Whilst any protease cleavage site may be employed, the following are preferred: 



10 



Enterokinase 
Factor Xa 

TEV(Tobacco Etch virus) 

Thrombin 

PreScission 



(DDDDKj) 
(IEGR1 / IDGRI) 
(ENLYFQ|G) 
(LVPRJ.GS) 
(LEVLFQJGP). 



Also embraced by the term protease cleavage site is an intein, which is a self- 
cleaving sequence. The self-splicing reaction is controllable, for example by 
varying the concentration of reducing agent present. 

15 ... ... 

In use, the protease cleavage site is cleaved and the N-terminal region 
(preferably the N-terminus) of the TM becomes exposed. The resulting 
polypeptide has a TM with an N-terminal domain or an intra domain that is 
substantially free from the remainder of the conjugate. This arrangement 

20 ensures that the N-terminal component (or intra domain) of the TM may interact 
directly with a Binding Site on a target cell. 



In a preferred embodiment, the TM and the protease cleavage site are distanced 
apart in the fusion protein by at most 10 amino acid residues, more preferably by 

25 at most 5 amino acid residues, and most preferably by zero amino acid residues. 
Thus, following cleavage of the protease cleavage site, a conjugate is provided 
with a TM that has an N-terminal domain that is substantially free from the 
remainder of the conjugate. This arrangement ensures that the N-terminal 
component of the Targeting Moiety may interact directly with a Binding Site on a 

30 target cell. 
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One advantage associated with the above-mentioned activation step is that the 
TM only becomes susceptible to N-terminal degradation once proteolytic 
cleavage of the fusion protein has occurred. In addition, the selection of a 
specific protease cleavage site permits selective activation of the polypeptide 
5 fusion into a di-chain conformation. 

Construction of the single-chain polypeptide fusion of the present invention 
places the protease cleavage site between the TM and the non-cytotoxic 
protease component. 

10 

It is preferred that, in the single-chain fusion, the TM is located between the 
protease cleavage site and the translocation component. This ensures that the 
TM is attached to the translocation domain (i.e. as occurs with native clostridial 
holotoxin), though in the case of the present invention the order of the two 

15 components is reversed vis-a-vis native holotoxin. A further advantage with this 
arrangement is that the TM is located in an exposed loop region of the fusion 
protein, which has minimal structural effects on the conformation of the fusion 
protein. In this regard, said loop is variously referred to as the linker, the 
activation loop, the inter-domain linker, or just the surface exposed loop (Schiavo 

20 et al 2000, Phys. Rev., 80, 717-766; Turton et al., 2002, Trends Biochem. Sci., 
27, 552-558). 

In one embodiment, in the single chain polypeptide, the non-cytotoxic protease 
component and the translocation component are linked together by a disulphide 

25 bond. Thus, following cleavage of the protease cleavage site, the polypeptide 
assumes a di-chain conformation, wherein the protease and translocation 
components remain linked together by the disulphide bond. To this end, it is 
preferred that the protease and translocation components are distanced apart 
from one another in the single chain fusion protein by a maximum of 100 amino 

30 acid residues, more preferably a maximum of 80 amino acid residues, particularly 
preferably by a maximum of 60 amino acid residues, and most preferably by a 
maximum of 50 amino acid residues. 
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In one embodiment, the non-cytotoxic protease component forms a disulphide 
bond with the translocation component of the fusion protein. For example, the 
--amino -acid residue of-the-protease component that forms the disulphide bond is 
5 located within the last 20, preferably within the last 10 C-terminal amino acid 
residues of the protease component. Similarly, the amino acid residue within the 
translocation component that forms the second part of the disulphide bond may 
be located within the first 20, preferably within the first 10 N-terminal amino acid 
residues of the translocation component. 

10 

Alternatively, in the single chain polypeptide, the non-cytotoxic protease 
component and the TM may be linked together by a disulphide bond. In this 
regard, the amino acid residue of the TM that forms the disulphide bond is 
preferably located away from the N-terminus of the TM, more preferably towards 
15 to C-terminus of the TM. 

In one embodiment, the non-cytotoxic protease component forms a disulphide 
bond with the TM component of the fusion protein. In this regard, the amino acid 
residue of the protease component that forms the disulphide bond is preferably 
20 located within the last 20, more preferably within the last 10 C-terminal amino 
acid residues of the protease component. Similarly, the amino acid residue 
within the TM component that forms the second part of the disulphide bond is 
preferably located within the last 20, more preferably within the last 10 C-terminal 
amino acid residues of the TM. 

25 . 

The above disulphide bond arrangements have the advantage that the protease 
and translocation components are arranged in a manner similar to that for native 
clostridial neurotoxin. By way of comparison, referring to the primary amino acid 
sequence for native clostridial neurotoxin, the respective cysteine amino acid 
30 residues are distanced apart by between 8 and 27 amino acid residues - taken 
from Popoff, MR & Marvaud, J-C, 1999, Structural & genomic features of 
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clostridial neurotoxins, Chapter 9, in The Comprehensive Sourcebook of Bacterial 
Protein Toxins. Ed. Alouf & Freer: 



Serotype 1 


Sequence- 


- 


'Native' length 
between C-C 


BoNT/A1 


CVRGIITSKTKS- 


-LDKGYNKALNDLC 


23 


BoNT/A2 


CVRGIIPFKTKS- 


-LDEGYNKALNDLC 


23 


BoNT/B 


CKSVKAPG 


IC 


8 


BoNT/C 


CHKAIDGRS — 


— LYNKTLDC 


15 


BoNT/D 


CLRLTK — 


-NSRDDSTC 


12 


BoNT/E 


CKN-IVSVK 


-GIRK— SIC 


13 


BoNT/F 


CKS-VIPRK- 


-GTKAPP-RLC 


15 


BoNT/G 


CKPVMYKNT 


— GKSE — QC 


13 


TeNT 


CKKIIPPTNIRENLYNRTASLTDLGGELC 


27 



5 1 1nformation from proteolytic strains only 



The fusion protein may comprise one or more purification tags, which are located 
N-terminal to the protease component and/ or C-terminal to the translocation 
component. 

10 

Whilst any purification tag may be employed, the following are preferred: 

His-tag (e.g. 6 * histidine), preferably as a C-terminal and/ or N-terminal tag 
MBP-tag (maltose binding protein), preferably as an N-terminal tag 
15 GST-tag (glutathione-S-transferase), preferably as an N-terminal tag 
His-MBP-tag, preferably as an N-terminal tag 
GST-MBP-tag, preferably as an N-terminal tag 
Thioredoxin-tag, preferably as an N-terminal tag 
CBD-tag (Chitin Binding Domain), preferably as an N-terminal tag. 
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According to a further embodiment of the present invention, one or more peptide 
spacer molecules may be included in the fusion protein. For example, a peptide 
spacer may be employed between a purification tag and the rest of the fusion 
•protein -molecule (e.g. -between an N-terminal purification- tag and a protease 
5 component of the present invention; and/ or between a C-terminal purification tag 
and a translocation component of the present invention). A peptide spacer may 
be also employed between the TM and translocation components of the present 
invention. 

10 A variety of different spacer molecules may be employed in any of the fusion 
proteins of the present invention. Examples of such spacer molecules include 
those illustrated in Figures 28 and 29. Particular mention here is made to GS15, 
GS20, GS25, and Hx27 - see Figures 28 and 29. 

15 The present inventors have unexpectedly found that the fusion proteins (eg. 
CPNv/A) of the present invention may demonstrate an improved binding activity 
for nociceptive sensory afferents when the size of the spacer is selected so that 
(in use) the C-terminus of the TM and the N-terminus of the translocation 
component are separated from one another by 40-105 angstroms, preferably by 

20 50-100 angstroms, and more preferably by 50-90 angstroms. In another 
embodiment, the preferred spacers have an amino acid sequence of 1 1-29 amino 
acid residues, preferably 15-27 amino acid residues, and more preferably 20-27 
amino acid residues. Suitable spacers may be routinely identified and obtained 
according to Crasto, C.J. and Feng, J.A. (2000) May, 13(5), pp. 309-312 - see 

25 also http://www.fccc/edu/research/labs/fenq/limker.html . 

In accordance with a second aspect of the present invention, there is provided a 
DNA sequence that encodes the above-mentioned single chain polypeptide. In a 
preferred aspect of the present invention, the DNA sequence is prepared as part 
30 of a DNA vector, wherein the vector comprises a promoter and terminator. 



In a preferred embodiment, the vector has a promoter selected from: 



0 ' 
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Promoter 



Induction Agent 



Typical Induction Condition 



5 17-lac operator 



-AraBAD 



Tac (hybrid) 



- - L-arabinose 



IPTG 



IPTG 



0.2 mM (0.05-2.0mM) 
0.2% (0:002-0.4%)- - 
0.2 mM (0.05-2.0mM) 



The DNA construct of the present invention is preferably designed in silico, and 
then synthesised by conventional DNA synthesis techniques. 

10 The above-mentioned DNA sequence information is optionally modified for 
codon-biasing according to the ultimate host cell (e.g. E. coif) expression system 
that is to be employed. 

The DNA backbone is preferably screened for any inherent nucleic acid 
15 sequence, which when transcribed and translated would produce an amino acid 
sequence corresponding to the protease cleave site encoded by the second 
peptide-coding sequence. This screening may be performed manually or with the 
assistance of computer software (e.g. the MapDraw program by DNASTAR, Inc.). 

20 According to a further embodiment of the present invention, there is provided a 
method of preparing a non-cytotoxic agent, comprising: 

a. contacting a single-chain polypeptide fusion protein of the 
invention with a protease capable of cleaving the protease 
cleavage site; 

25 b. cleaving the protease cleavage site, and thereby forming a 



This aspect provides a di-chain polypeptide, which generally mimics the structure 
of clostridial holotoxin. In more detail, the resulting di-chain polypeptide typically 
30 has a structure wherein: 

a. the first chain comprises the non-cytotoxic protease, or a 
'"' " ! fragment thereof, which protease of protease fragment is 



di-chain fusion protein. 
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capable of cleaving a protein of the exocytic fusion apparatus 
of a nociceptive sensory afferent; 
b. the second chain comprises the TM and the translocation 
•domain- that- is- capable- of-translocating the protease or 
5 protease fragment from within an endosome, across the 

endosomal membrane and into the cytosol of the nociceptive 
sensory afferent; and 
the first and second chains are disulphide linked together. 

10 According to a further aspect of the present invention, there is provided use of a 
single chain or di-chain polypeptide of the invention, for the manufacture of a 
medicament for treating, preventing or ameliorating pain. 

According to a related aspect, there is provided a method of treating, preventing 
15 or ameliorating pain in a subject, comprising administering to said patient a 
therapeutically effective amount of a single chain or di-chain polypeptide of the 
invention. 

The present invention addresses a wide range of pain conditions, in particular 
20 chronic pain conditions. Preferred conditions include cancerous and non- 
cancerous pain, inflammatory pain and neuropathic pain. The opioid-fusions of 
the present application are particularly suited to addressing inflammatory pain, 
though may be less suited to addressing neuropathic pain. The galanin-fusions 
are more suited to addressing neuropathic pain. 

25 

In use, the polypeptides of the present invention are typically employed in the 
form of a pharmaceutical composition in association with a pharmaceutical 
carrier, diluent and/or excipient, although the exact form of the composition may 
be tailored to the mode of administration. Administration is preferably to a 
30 mammal, more preferably to a human. 
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The polypeptides may, for example, be employed in the form of a sterile solution 
for intra-articular administration or intra-cranial administration. Spinal injection 
(e.g. epidural or intrathecal) is preferred. 

5 The dosage ranges for administration of the polypeptides of the present invention 
are those to produce the desired therapeutic effect. It will be appreciated that the 
dosage range required depends on the precise nature of the components, the 
route of administration, the nature of the formulation, the age of the patient, the 
nature, extent or severity of the patient's condition, contraindications, if any, and 

1 0 the judgement of the attending physician. 

Suitable daily dosages are in the range 0.0001-1 mg/kg, preferably 0.0001- 
0.5 mg/kg, more preferably 0.002-0.5 mg/kg, and particularly preferably 0.004- 
0.5 mg/kg. The unit dosage can vary from less that 1 microgram to 30mg, but 
15 typically will be in the region of 0.01 to 1 mg per dose, which may be 
administered daily or preferably less frequently, such as weekly or six monthly. 

A particularly preferred dosing regimen is based on 2.5 ng of fusion protein (e.g. 
CPNv/A) as the 1X dose. In this regard, preferred dosages are in the range 1X- 

20 100X (i.e. 2.5-250 ng). This dosage range is significantly lower (i.e. at least 10- 
fold, typically 100-fold lower) than would be employed with other types of 
analgesic molecules such as NSAIDS, morphine, and gabapentin. Moreover, the 
above-mentioned difference is considerably magnified when the same 
comparison is made on a molar basis - this is because the fusion proteins of the 

25 present invention have a considerably greater Mw than do conventional 'small* 
molecule therapeutics. 

Wide variations in the required dosage, however, are to be expected depending 
on the precise nature of the components, and the differing efficiencies of various 
30 routes of administration. 



i- 
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Variations in these dosage levels can be adjusted using standard empirical 
routines for optimisation, as is well understood in the art. 

Compositions- suitable for injection may be-in-the-form of solutions, suspensions 
5 or emulsions, or dry powders which are dissolved or suspended in a suitable 
vehicle prior to use. 

Fluid unit dosage forms are typically prepared utilising a pyrogen-free sterile 
vehicle. The active ingredients, depending on the vehicle and concentration 
10 used, can be either dissolved or suspended in the vehicle. 

In preparing administrable solutions, the polypeptides can be dissolved in a 
vehicle, the solution being made isotonic if necessary by addition of sodium 
chloride and sterilised by filtration through a sterile filter using aseptic techniques 
15 before -filling into suitable sterile vials or ampoules and sealing. Alternatively, if 
solution stability is adequate, the solution in its sealed containers may be 
sterilised by autoclaving. 

Advantageously additives such as buffering, solubilising, stabilising, preservative 
20 or bactericidal, suspending or emulsifying agents may be dissolved in the vehicle. 

Dry powders which are dissolved or suspended in a suitable vehicle prior to use 
may be prepared by filling pre-sterilised drug substance and other ingredients 
into a sterile container using aseptic technique in a sterile area. 

25 

Alternatively the polypeptides and other ingredients may be dissolved in an 
aqueous vehicle, the solution is sterilized by filtration and distributed into suitable 
containers using aseptic technique in a sterile area. The product is then freeze 
dried and the containers are sealed aseptically. 

30 

Parenteral suspensions, suitable for intramuscular, subcutaneous or intradermal 
injection, are prepared in substantially the same manner, except that the sterile 
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components are suspended in the sterile vehicle, instead of being dissolved and 
sterilisation cannot be accomplished by filtration. The components may be 
isolated in a sterile state or alternatively it may be sterilised after isolation, e.g. by 
gamma irradiation. 

5 

Advantageously, a suspending agent for example polyvinylpyrrolidone is included 
in the composition/s to facilitate uniform distribution of the components. 

Definitions Section 

10 

Targeting Moiety (TM) means any chemical structure associated with an agent 
that functionally interacts with a Binding Site to cause a physical association 
between the agent and the surface of a target cell. In the context of the present 
invention, the target cell is a nociceptive sensory afferent. The term TM 

15 • embraces any molecule (i.e. a naturally occurring molecule, or a 
chemically/physically modified variant thereof) that is capable of binding to a 
Binding Site on the target cell, which Binding Site is capable of internalisation 
(e.g. endosome formation) - also referred to as receptor-mediated endocytosis. 
The TM may possess an endosomal membrane translocation function, in which 

20 case separate TM and Translocation Domain components need not be present in 
an agent of the present invention. 



The TM of the present invention binds (preferably specifically binds) to a 
nociceptive sensory afferent (e.g. a primary nociceptive afferent). In this regard, 

25 specifically binds means that the TM binds to a nociceptive sensory afferent (e.g. 
a primary nociceptive afferent) with a greater affinity than it binds to other 
neurons such as non-nociceptive afferents, and/ or to motor neurons (i.e. the 
natural target for clostridial neurotoxin holotoxin). The term "specifically binding" 
can also mean that a given TM binds to a given receptor, for example the ORI_i 

30 receptor, with a binding affinity (Ka) of 10 6 M' 1 or greater, preferably 10 7 M" 1 or 
greater, more preferably 10 8 M" 1 or greater, and most preferably, 10 9 M" 1 or 
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For the purposes of this invention, an agonist is defined as a molecule that is 
capable of stimulating the process of exocytic fusion in a target cell, which 
process-is susceptible to inhibition by a protease (or fragment thereof) capable of 
5 cleaving a protein of the exocytic fusion apparatus in said target cell. 

Accordingly, the particular agonist definition of the present invention would 
exclude many molecules that would be conventionally considered as agonists. 

10 For example, nerve growth factor (NGF) is an agonist in respect of its ability to 
promote neuronal differentiation via binding to a TrkA receptor. However, NGF is 
not an agonist when assessed by the above criteria because it is not a principal 
inducer of exocytic fusion. In addition, the process that NGF stimulates (i.e. cell 
differentiation) is not susceptible to inhibition by the protease activity of a non- 
15 cytotoxic toxin- molecule. 

The term "fragment", when used in relation to a protein, means a peptide having 
at least thirty-five, preferably at least twenty-five, more preferably at least twenty, 
and most preferably at least ten amino acid residues of the protein in question. 

20 

The term "variant", when used in relation to a protein, means a peptide or peptide 
fragment of the protein that contains one or more analogues of an amino acid 
(e.g. an unnatural amino acid), or a substituted linkage. 

25 The term "derivative", when used in relation to a protein, means a protein that 
comprises the protein in question, and a further peptide sequence. The further 
peptide sequence should preferably not interfere with the basic folding and thus 
conformational structure of the original protein. Two or more peptides (or 
fragments, or variants) may be joined together to form a derivative. Alternatively, 

30 a peptide (or fragment, or variant) may be joined to an unrelated molecule (e.g. a 
second, unrelated peptide). Derivatives may be chemically synthesized, but will 
be ' typically prepared by recombinant 1 nucleic acid methods. Additional *- ••*»:•*.-*.-. 
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components such as lipid, and/or polysaccharide, and/or polyketide components 
may be included. 

Throughout this specification, reference to the "ORLi receptor" embraces all 
5 members of the ORI^ receptor family. Members of the ORLi receptor family 
typically have a seven transmembrane domain structure and are coupled to G- 
proteins of the Gj and G 0 families. A method for determining the G-protein- 
stimulating activity of ligands of the ORL1 receptor is given in Example 12. A 
method for measuring reduction in cellular cAMP levels following ORU activation 
10 is given in Example 1 1 . A further characteristic of members of the ORL1 receptor 
family is that they are typically able to bind nociceptin (the natural ligand of 
ORL1). As an example, all alternative splice variants of the ORLt receptor, are 
members of the ORL1 receptor family. 

15 The term non-cytotoxic means that the protease molecule in question does not 
kill the target cell to which it has been re-targeted. 



The protease of the present invention embraces all naturally-occurring non- 
cytotoxic proteases that are capable of cleaving one or more proteins of the 
20 exocytic fusion apparatus in eukaryotic cells. 

The protease of the present invention is preferably a bacterial protease (or 
fragment thereof). More preferably the bacterial protease is selected from the 
genera Clostridium or Neisseria (e.g. a clostridial L-chain, or a neisserial IgA 
25 protease preferably from N. gonorrhoeae). 

The present invention also embraces modified non-cytotoxic proteases, which 
include amino acid sequences that do not occur in nature and/or synthetic amino 
acid residues, so long as the modified proteases still demonstrate the above- 
30 mentioned protease activity. 
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The protease of the present invention preferably demonstrates a serine or 
metalloprotease activity (e.g. endopeptidase activity). The protease is preferably 
specific for a SNARE protein (e.g. SNAP-25, synaptobrevinA/AMP, or syntaxin). 

5 Particular mention is made to the protease domains of neurotoxins, for example 
the protease domains of bacterial neurotoxins. Thus, the present invention 
embraces the use of neurotoxin domains, which occur in nature, as well as 
recombinantly prepared versions of said naturally-occurring neurotoxins. 

10 Exemplary neurotoxins are produced by Clostridia, and the term clostridial 
neurotoxin embraces neurotoxins produced by C. tetani (TeNT), and by C. 
botulinum (BoNT) serotypes A-G, as well as the closely related BoNT-like 
neurotoxins produced by C. baratii and C. butyricum. The above-mentioned 
abbreviations are used throughout the present specification. For example, the 

15 nomenclature BoNT/A denotes the source of neurotoxin as BoNT (serotype A). 
Corresponding nomenclature applies to other BoNT serotypes. 

The term L-chain fragment means a component of the L-chain of a neurotoxin, 
which fragment demonstrates a metalloprotease activity and is capable of 
20 proteolytically cleaving a vesicle and/or plasma membrane associated protein 
involved in cellular exocytosis. 

A Translocation Domain is a molecule that enables translocation of a protease (or 
fragment thereof) into a target cell such that a functional expression of protease 
25 activity occurs within the cytosol of the target cell. Whether any molecule (e.g. a 
protein or peptide) possesses the requisite translocation function of the present 
invention may be confirmed by any one of a number of conventional assays. 

For example, Shone C. (1987) describes an in vitro assay employing liposomes, 
30 which are challenged with a test molecule. Presence of the requisite 
translocation function is confirmed by release from the liposomes of K + and/or 
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labelled NAD, which may be readily monitored [see Shone C. (1987) Eur. J. 
Biochem; vol. 167(1): pp. 175-180]. 

A-further example is-provided by Blaustein R. (1987)r which describes a simple in 
5 vitro assay employing planar phospholipid bilayer membranes. The membranes 
are challenged with a test molecule and the requisite translocation function is 
confirmed by an increase in conductance across said membranes [see Blaustein 
(1987) FEBS Letts; vol. 226, no. 1: pp. 115-120]. 

10 Additional methodology to enable assessment of membrane fusion and thus 
identification of Translocation Domains suitable for use in the present invention 
are provided by Methods in Enzymology Vol 220 and 221, Membrane Fusion 
Techniques, Parts A and B, Academic Press 1993. 

15 The Translocation Domain is preferably capable of formation of ion-permeable 
pores in lipid membranes under conditions of low pH. Preferably it has been 
found to use only those portions of the protein molecule capable of pore- 
formation within the endosomal membrane. 

20 The Translocation Domain may be obtained from a microbial protein source, in 
particular from a bacterial or viral protein source. Hence, in one embodiment, the 
Translocation Domain is a translocating domain of an enzyme, such as a 
bacterial toxin or viral protein. 

25 It is well documented that certain domains of bacterial toxin molecules are 
capable of forming such pores. It is also known that certain translocation 
domains of virally expressed membrane fusion proteins are capable of forming 
such pores. Such domains may be employed in the present invention. 

30 The Translocation Domain may be of a clostridial origin, namely the Hn domain 
(or a functional component thereof). Hn means a portion or fragment of the H- 
chain of a clostridial neurotoxin -approximately equivalent to the amino-terminal 
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half of the H-chain, or the domain corresponding to that fragment in the intact Hi- 
cham. It is preferred that the H-chain substantially lacks the natural binding 
function of the H c component of the H-chain. In this regard, the H c function may 
be removed by deletion of the He amino acid sequence (either at the DNA 
5 synthesis level, or at the post-synthesis level by nuclease or protease treatment). 
Alternatively, the He function may be inactivated by chemical or biological 
treatment. Thus, the H-chain is preferably incapable of binding to the Binding 
Site on a target cell to which native clostridial neurotoxin (i.e. holotoxin) binds. 

10 In one embodiment, the translocation domain is a H N domain (or a fragment 
thereof) of a clostridial neurotoxin. Examples of suitable clostridial Translocation 
Domains include: 

Botulinum type A neurotoxin - amino acid residues (449-871 ) 
15 • Botulinum type B neurotoxin - amino acid residues (441-858) 



For further details on the genetic basis of toxin production in Clostridium 
botulinum and C. tetani, we refer to Henderson et al (1997) in The Clostridia: 
25 Molecular Biology and Pathogenesis, Academic press. 

The term H N embraces naturally-occurring neurotoxin H N portions, and modified 
Hn portions having amino acid sequences that do not occur in nature and/or 
synthetic amino acid residues, so long as the modified H N portions still 
30 demonstrate the above-mentioned translocation function. 



20 



Botulinum type C neurotoxin 
Botulinum type D neurotoxin 
Botulinum type E neurotoxin 
Botulinum type F neurotoxin 
Botulinum type G neurotoxin 
Tetanus neurotoxin 



amino acid residues (442-866) 
amino acid residues (446-862) 
amino acid residues (423-845) 
amino acid residues (440-864) 
amino acid residues (442-863) 
amino acid residues (458-879) 
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Alternatively, the Translocation Domain may be of a non-clostridial origin (see 
Table 4). Examples of non-clostridial Translocation Domain origins include, but 
not be restricted to, the translocation domain of diphtheria toxin [0=Keefe et al., 
Proe. Natlr-Acad. Sci. USA (-1992)- 89,-6202-6206; Silverman et al., J. Biol. Chem. 
5 (1993) 269, 22524-22532; and London, E. (1992) Biochem. Biophys. Acta., 1 1 12, 
pp.25-51], the translocation domain of Pseudomonas exotoxin type A [Prior et al. 
Biochemistry (1992) 31, 3555-3559], the translocation domains of anthrax toxin 
[Blanke et al. Proc. Natl. Acad. Sci. USA (1996) 93, 8437-8442], a variety of 
fusogenic or hydrophobic peptides of translocating function [Plank et al. J. Biol. 
10 Chem. (1994) 269, 12918-12924; and Wagner et al (1992) PNAS, 89, pp.7934- 
7938], and amphiphilic peptides [Murata et al (1992) Biochem., 31, pp.1 986- 
1992]. The Translocation Domain may mirror the Translocation Domain present 
in a naturally-occurring protein, or may include amino acid variations so long as 
the variations do not destroy the translocating ability of the Translocation Domain. 

15 

Particular examples of viral Translocation Domains suitable for use in the present 
invention include certain translocating domains of virally expressed membrane 
fusion proteins. For example, Wagner et al. (1992) and Murata et al. (1992) 
describe the translocation (i.e. membrane fusion and vesiculation) function of a 

20 number of fusogenic and amphiphilic peptides derived from the N-terminal region 
of influenza virus haemagglutinin. Other virally expressed membrane fusion 
proteins known to have the desired translocating activity are a translocating 
domain of a fusogenic peptide of Semliki Forest Virus (SFV), a translocating 
domain of vesicular stomatitis virus (VSV) glycoprotein G, a translocating domain 

25 of SER virus F protein and a translocating domain of Foamy virus envelope 
glycoprotein. Virally encoded Aspike proteins have particular application in the 
context of the present invention, for example, the E1 protein of SFV and the G 
protein of the G protein of VSV. 

30 Use of the Translocation Domains listed in Table (below) includes use of 
sequence variants thereof. A variant may comprise one or more conservative 
^nucleic -acid substitutions and/ or nucleic acid deletions or insertions, with the <*-'-• "-••>• ; - 
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proviso that the variant possesses the requisite translocating function. A variant 
may also comprise one or more amino acid substitutions and/ or amino acid 
deletions or insertions, so long as the variant possesses the requisite 
translocating function. 



Translocation 

domain source 


Amino AciH 

residues 


rvcici KSt ivco 


Diphtheria toxin 


194-380 


Silverman etal., 1994, J. Biol. 
Chem. 269, 22524-22532 
London E., 1992, Biochem. 
Biophys. Acta., 1113, 25-51 


Domain II of 

pseudomonas 

exotoxin 


405-613 


Prior etal., 1992, Biochemistry 
31, 3555-3559 

Kihara & Pastan, 1994, Bioconj 
Chem. 5, 532-538 


Influenza virus 
haemagglutinin 


GLFGAIAGFIENGWE 
GMIDGWYG, and 
Variants thereof 


Plank etal., 1994, J. Biol. Chem. 

269, 12918-12924 

Wagner etal., 1992, PNAS, 89, 

7934-7938 

Murata etal, 1992, Biochemistry 
31, 1986-1992 


Qomliki FnrAQt \/iritQ 

fusogenic protein 


Tranclripatirkn Hrnnoin 

1 1 dl loiULrCfUUI 1 UUIHdlll 


Kiolian o# al IQQfi 1 Poll Rinl 

134(4), 863-872 


Vesicular Stomatitis 
virus glycoprotein G 


118-139 


Yao etal., 2003, Virology 310(2), 
319-332 


SER virus F protein 


Translocation domain 


Seth etal., 2003, J Virol 77(1 1) 
6520-6527 


Foamy virus 

envelope 

glycoprotein 


Translocation domain 


Picard-Maureau et a/., 2003, J 
Virol. 77(8), 4722-4730 



; "j to -»T 
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Figures 

-Figure-1 - Purifieation-of-a-LG/A-nociceptin-H N /A-fusion protein 
5 Figure 2 Purification of a nociceptin-LC/A-HN/A fusion protein 
Figure 3 Purification of a LC/C-nociceptin-HN/C fusion protein 
Figure 4 Purification of a LC/A-met enkephalin-HN/A fusion protein 
Figure 5 Comparison of binding efficacy of a LC/A-nociceptin-H N /A fusion 
protein and a nociceptin-LC/A-HN/A fusion protein 
10 Figure 6 In vitro catalytic activity of a LC/A-nociceptin-HN/A fusion protein 
Figure 7 Purification of a LC/A-nociceptin variant-H N /A fusion protein 
Figure 8 Comparison of binding efficacy of a LC/A-nociceptin-HN/A fusion 

protein and a LC/A-nociceptin variant-HN/A fusion protein 
Figure 9 Expressed / purified LC/A-nociceptin-H N /A fusion protein family with 
15 variable spacer length product(s) 

Figure 10 Inhibition of SP release and cleavage of SNAP-25 by CPN-A 
Figure 1 1 Inhibition of SP release and cleavage of SNAP-25 over extended 

time periods after exposure of DRG to CPN-A 
Figure 12 Cleavage of SNAP-25 by CPNv-A 
20 Figure 13 Cleavage of SNAP-25 over extended time periods after exposure of 
DRG to CPNv-A 

Figure 14 CPNv-A fusion-mediated displacement of [3H]-nociceptin binding 
Figure 15 Expressed / purified CPNv(Ek)-A product 
Figure 16 Cleavage of SNAP-25 by CPNv(Ek)-A 
25 Figure 17 Expressed / purified CPNv-C product 
Figure 18 Cleavage of syntaxin by CPNv-C 

Figure 19 CPN-A efficacy in the Acute Capsaicin-lnduced Mechanical 
Allodynia model 

Figure 20 CPN-A efficacy in the Streptozotocin (STZ)-lnduced Peripheral 
30 Diabetic Neuropathy (Neuropathic Pain) model 

Figure 21 CPNv-A efficacy in the Acute Capsaicin-lnduced Mechanical 
•' Allodynia model •«••--• »■•• ■•• 
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Figure 22 Expressed / purified LC/A-CPLE-H N /A product 

Figure 23 Expressed / purified LC/A-CPBE-H N /A product 

Figure 24 Expressed / purified CPOP-A product 

•Figure-25 Expressed / purified CPOPv-A product 

5 Figure 26 In vitro SNAP-25 cleavage in a DRG cell model 

Figure 27 Expressed / purified CPNv-A-FXa-HT (removable his-tag) 

Figure 28 In vitro efficacy of LC/A-nociceptin-H N /A fusion proteins with 
variable spacer length, as assessed by ligand competition assay 

Figure 29 In vitro efficacy of LC/A-nociceptin-H N /A fusion proteins with 

10 variable spacer length, as assessed by in vitro SNAP-25 cleavage 



The Figures are now described in more detail. 



Figure 1 - Purification of a LC/A-nociceptin-H N /A fusion protein 

15 

Using the methodology outlined in Example 9, a LC/A-nociceptin-H N /A fusion 
protein was purified from E. coli BL21 cells. Briefly, the soluble products obtained 
following cell disruption were applied to a nickel-charged affinity capture column. 
Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 

20 activate the fusion protein and remove the maltose-binding protein (MBP) tag, 
then re-applied to a second nickel-charged affinity capture column. Samples 
from the purification procedure were assessed by SDS-PAGE (Panel A) and 
Western blotting (Panel B). Anti-nociceptin antisera (obtained from Abeam) were 
used as the primary antibody for Western blotting. The final purified material in 

25 the absence and presence of reducing agent is identified in the lanes marked [-] 
and [+] respectively. 



Figure 2 - Purification of a nociceptin-LC/A-H N /A fusion protein 



30 Using the methodology outlined in Example 9, a nociceptin-LC/A-H N /A fusion 
protein was purified from E. coli BL21 cells. Briefly, the soluble products obtained 
foltowingxell disruption were applied to a nickel-charged affinity capture column. 
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Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 
activate the fusion protein and remove the maltose-binding protein (MBP) tag, 
then re-applied to a second nickel-charged affinity capture column. Samples 
from the purification procedure- were -assessed by SDS-PAGE (Panel A) and 
5 Western blotting (Panel B). Anti-nociceptin antisera (obtained from Abeam) were 
used as the primary antibody for Western blotting. The final purified material in 
the absence and presence of reducing agent is identified in the lanes marked [-] 
and [+] respectively. 

10 Figure 3 - Purification of a LC/C-nociceptin-H N /C fusion protein 

Using the methodology outlined in Example 9, an LC/C-nociceptin-H N /C fusion 
protein was purified from E. coli BL21 cells. Briefly, the soluble products obtained 
following cell disruption were applied to a nickel-charged affinity capture column. 

15 Bound proteins were eluted with 100 mM imidazole, treated with Factor Xa to 
activate the fusion protein and remove the maltose-binding protein (MBP) tag, 
then re-applied to a second nickel-charged affinity capture column. Samples 
from the purification procedure were assessed by SDS-PAGE (Panel A) and 
Western blotting (Panel B). Anti-nociceptin antisera (obtained from Abeam) were 

20 used as the primary antibody for Western blotting. The final purified material in 
the absence and presence of reducing agent is identified in the lanes marked [-] 
and [+] respectively. 

Figure 4 - Purification of a LC/A-met enkephalin-H N /A fusion protein 

25 

Using the methodology outlined in Example 9, an LC/A-met enkephalin-H N /A 
fusion protein was purified from E. coli BL21 cells. Briefly, the soluble products 
obtained following cell disruption were applied to a nickel-charged affinity capture 
column. Bound proteins were eluted with 100 mM imidazole, treated with Factor 
30 Xa to activate the fusion protein and remove the maltose-binding protein (MBP) 
tag, then re-applied to a second nickel-charged affinity capture column. Samples 
" from the 1 purification.procedure were assessed by SDS-PAGE. The final purified 
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material in the absence and presence of reducing agent is identified in the lanes 
marked [-] and [+] respectively. 

Figure 5 - Comparison of binding efficacy of a LC/A-nociceptin-H N /A fusion 
5 protein and a nociceptin-LC/A-H N /A fusion protein 

The ability of nociceptin fusions to bind to the ORI_i receptor was assessed using 
a simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) 
were exposed to varying concentrations of test material in the presence of 1 nM 
10 [3H]-nociceptin. The reduction in specific binding of the radiolabeled ligand was 
assessed by scintillation counting, and plotted in comparison to the efficacy of 
unlabelled ligand (Tocris nociceptin). It is clear that the LC/A-nociceptin-H N /A 
fusion is far superior to the nociceptin-LC/A-HN/A fusion at interacting with the 
ORLi receptor. 

15 

Figure 6 - In vitro catalytic activity of a LC/A-nociceptin-H N /A fusion protein 

The in vitro endopeptidase activity of the purified LC/A-nociceptin-H N /A fusion 
protein was determined essentially as described in Chaddock et al 2002, Prot. 
20 Express Purif. 25, 219-228. Briefly, SNAP-25 peptide immobilised to an ELISA 
plate was exposed to varying concentrations of fusion protein for 1 hour at 37°C. 
Following a series of washes, the amount of cleaved SNAP-25 peptide was 
quantified by reactivity with a specific antisera. 

25 Figure 7 - Purification of a LC/A-nociceptin variant-H N /A fusion protein 

Using the methodology outlined in Example 9, an LC/A-nociceptin variant-H N /A 
fusion protein was purified from E. coii BL21 cells. Briefly, the soluble products 
obtained following cell disruption were applied to a nickel-charged affinity capture 

30 column. Bound proteins were eluted with 100 mM imidazole, treated with Factor 
Xa to activate the fusion protein and remove the maltose-binding protein (MBP) 

' ^la^'fn^r^SppBia*)^ second nicke'PchaTged affinity Capture column. Samples-" ' »>• ->t 
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from the purification procedure were assessed by SDS-PAGE. The final purified 
material in the absence and presence of reducing agent is identified in the lanes 
marked [-] and [+] respectively. 

Figure 8 - Comparison of binding efficacy of a LC/A-nociceptin-H N /A fusion 
protein and a LC/A-nociceptin variant-H N /A fusion protein 

The ability of nociceptin fusions to bind to the ORLi receptor was assessed using 
a simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) 
were exposed to varying concentrations of test material in the presence of 1nM 
[3H]-nociceptin. The reduction in specific binding of the radiolabelled ligand was 
assessed by scintillation counting, and plotted in comparison to the efficacy of 
unlabelled ligand (Tocris nociceptin). It is clear that the LC/A-nociceptin variant- 
HN/A fusion (CPNv-LHA) is superior to the LC/A-nociceptin variant-H N /A fusion 
(CPN-LHA) at interacting with the ORLi receptor. 

Figure 9 - Expressed / purified LC/A-nociceptin-H N /A fusion protein family 
with variable spacer length product(s) 

Using the methodology outlined in Example 9, variants of the LC/A-CPN-H N /A 
fusion consisting of GS10, GS30 and HX27 are purified from E. coli cell paste. 
Samples from the purification of LC/A-CPN(GS10)-Hn/A, LC/A-CPN(GS15)-Hn/A, 
LC/A-CPN(GS25)-H n /A, LC/A-CPN(GS30)-H N /A and LC/A-CPN(HX27)-H N /A 
were assessed by SDS-PAGE prior to staining with Coomassie Blue. The 
electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPBE-A. Top panel: M = benchmark 
molecular mass markers; S = total E. coli protein soluble fraction; FT = proteins 
that did not bind to the Ni 2+ -charged Sepharose column; FUSION = fusion protein 
eluted by the addition of imidazole. Bottom panel: Lane 1 = benchmark 
molecular mass markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = 
purified material following initial capture on Ni 2+ -charged Sepharose; Lane 4 = 
Factor ^Xa treated material prior to final capture "on Nf* + -crrarged Sepharose; 
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Lane 5 = purified final material post activation with Factor Xa (5 pi); Lane 6 = 
purified final material post activation with Factor Xa (10 ul); Lane 7 = purified final 
material post activation with Factor Xa (20 pi); Lane 8 = purified final material 
-post-activatien-with-Facto^ post 
5 activation with Factor Xa + DTT (10 pi); Lane 10 = purified final material post 
activation with Factor Xa + DTT (20 pi). 

Figure 10 - Inhibition of SP release and cleavage of SNAP-25 by CPN-A 

10 Briefly, primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPN-A for 24 hours. Cellular proteins were separated by SDS- 
PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an 
assessment of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was 
calculated by densitometric analysis and plotted against fusion concentration 

15 (dashed line). Material was also recovered for an analysis of substance P 
content using a specific EIA kit. Inhibition of substance P release is illustrated by 
the solid line. The fusion concentration required to achieve 50% maximal 
SNAP-25 cleavage is estimated to be 6.30±2.48 nM. 

20 Figure 11- Inhibition of SP release and cleavage of SNAP-25 over extended 
time periods after exposure of DRG to CPN-A 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPN-A for 24 hours. Botulinum neurotoxin (BoNT/A) was used 

25 as a control. After this initial exposure, extracellular material was removed by 
washing, and the cells incubated at 37°C for varying periods of time. At specific 
time points, cellular proteins were separated by SDS-PAGE, Western blotted, 
and probed with anti-SNAP-25 to facilitate an assessment of SNAP-25 cleavage. 
The percentage of cleaved SNAP-25 was calculated by densitometric analysis 

30 and illustrated by the dotted lines. Material was also recovered for an analysis of 
substance P content using a specific EIA kit. Inhibition of substance P release is 
*4 -s . illustrated by the solid lines. - • , • — ... v 7. .v- 
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Figure 12 - Cleavage of SNAP-25 by CPNv-A 



Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
5 concentrations of CPNv-A for 24 hours. Cellular proteins were separated by 
SDS-PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an 
assessment of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was 
calculated by densitometric analysis. The fusion concentration required to 
achieve 50% maximal SNAP-25 cleavage is estimated to be 1 .38±0.36 nM. 

0 

Figure 13 - Cleavage of SNAP-25 over extended time periods after exposure 
of DRG to CPNv-A 



Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
15 concentrations of CPNv-A for 24 hours. CPN-A was used as a control. After this 
initial exposure, extracellular material was removed by washing, and the cells 
incubated at 37°C for varying periods of time. At specific time points, cellular 
proteins were separated by SDS-PAGE, Western blotted, and probed with anti- 
SNAP-25 to facilitate an assessment of SNAP-25 cleavage. The percentage of 
20 cleaved SNAP-25 was calculated by densitometric analysis. 



Figure 14 - CPNv-A fusion-mediated displacement of [3H]-nociceptin 
binding 



25 The ability of nociceptin fusions to bind to the ORU receptor was assessed using 
a simple competition-based assay. Primary cultures of dorsal root ganglia (DRG) 
were exposed to varying concentrations of test material in. the presence of 1 nM 
[3H]-nociceptin. The reduction in specific binding of the radiolabeled ligand was 
assessed by scintillation counting, and plotted in comparison to the efficacy of 

30 unlabelled ligand (Tocris nociceptin). It is clear that the LC/A-nociceptin variant- 
s/A fusion (labelled as CPNv-LHnA) is superior to the LC/A-nociceptin-Htvi/A 
9 •■■ ■*•-• fusion (labelled as CPN-LHnA) at interacting with the ORI_i receptor. v " 
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Figure 15 - Expressed / purified CPNv(Ek)-A product 

-Proteins-were subjected to SDS-PAGE prior to staining -with Coomassie Blue. 
5 The electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPNv(Ek)-A. Lane 1 = benchmark 
molecular mass markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = 
purified material following initial capture on Ni 2+ -charged Sepharose; Lane 4 = 
purified final material post activation with enterokinase (5 pi); Lane 5 = purified 
10 final material post activation with enterokinase (10 ul); Lane 6 = purified final 
material post activation with enterokinase (20 ul); Lane 7 = purified final material 
post activation with enterokinase + DTT (5 pi); Lane 8 = purified final material 
post activation with enterokinase + DTT (10 ul); Lane 9 = purified final material 
post activation with enterokinase + DTT (20 pi). 

15 

Figure 16 - Cleavage of SNAP-25 by CPNv(Ek)-A 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPNv(Ek)-A for 24 hours. Cellular proteins were separated by 

20 SDS-PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an 
assessment of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was 
calculated by densitometry analysis. CPNv-A as prepared in Example 9 was 
used for comparison purposes. The percentage cleavage of SNAP-25 by 
CPNv(Ek)-A (labelled as En activated) and CPNv-A (labelled as Xa activated) are 

25 illustrated. 

Figure 17 - Expressed / purified CPNv-C product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
30 The electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPNv-C. Lane 1 = benchmark 

- .. ,.-..■* > molecular mass*maFkers;^_ane£ = :tpta*'£/ coli protein soluble fraction; Lane 3- = - a ? -*;r 
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purified material following initial capture on Ni 2+ -charged Sepharose; Lane 4 = 
Factor Xa treated material prior to final capture on Ni 2+ -charged Sepharose; 
Lane 5 = purified material following second capture on Ni 2+ -charged Sepharose; 
-Lane 6 =-final purified material;- Lane 7 = final purified material + DTT; Lane 8 = 
benchmark molecular mass markers. 

Figure 18 - Cleavage of syntaxin by CPNv-C 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPNv-C for 24 hours. Cellular proteins were separated by 
SDS-PAGE, Western blotted, and probed with anti-syntaxin to facilitate an 
assessment of syntaxin cleavage. The percentage of cleaved syntaxin was 
calculated by densitometric analysis. The fusion concentration required to 
achieve 50% maximal syntaxin cleavage is estimated to be 3.13+1.96 nM. 

Figure 19 - CPN-A efficacy in the Acute Capsaicin-lnduced Mechanical 
Allodynia model 

The ability of an LC/A-nociceptin-H N /A fusion (CPN/A) to inhibit capsaicin-induced 
mechanical allodynia was evaluated following subcutaneous intraplantar injection 
in the rat hind paw. Test animals were evaluated for paw withdrawal frequency 
(PWF%) in response to a 10 g Von Frey filament stimulus series (10 stimuli x 3 
trials) prior to recruitment into the study (Pre-Treat); after subcutaneous 
intraplantar treatment with CPN/A but before capsaicin (Pre-CAP); and following 
capsaicin challenge post-injection of CPN/A (average of responses at 15* and 
30"; CAP). Capsaicin challenge was achieved by injection of 10 uL of a 0.3% 
solution. Sample dilutions were prepared in 0.5% BSA/saline. 



WO 2006/059093 Best Available Copy 



PCT/GB2005/004585 



41 

Figure 20 - CPN-A efficacy in the Streptozotocin (STZ)-lnduced Peripheral 
Diabetic Neuropathy (Neuropathic Pain) model 

-Male-Sprague-Dawley rats (250-300 g)-are-treated with 65-mg/kg-STZ in citrate 
5 buffer (I.V.) and blood glucose and lipid are measured weekly to define the 
readiness of the model. Paw Withdrawal Threshold (PWT) is measured in 
response to a Von Frey filament stimulus series over a period of time. Allodynia is 
said to be established when the PWT on two consecutive test dates (separated 
by 1 week) measures below 6 g on the scale. At this point, rats are randomized to 
10 either a saline group (negative efficacy control), gabapentin group (positive 
efficacy control) or a test group (CPN/A). Test materials (20-25 pi) are injected 
subcutaneously as a single injection (except gabapentin) and the PWT is 
measured at 1 day post-treatment and periodically thereafter over a 2 week 
period. Gabapentin (30 mg/kg i.p. @ 3 ml/kg injection volume) is injected daily, 
15 2 hours prior to the start of PWT testing. 

Figure 21 - CPNv-A efficacy in the Acute Capsaicin-lnduced Mechanical 
Allodynia model 

20 The ability of an LC/A-nociceptin variant-H N /A fusion (CPNv/A) to inhibit 
capsaicin-induced mechanical allodynia was evaluated following subcutaneous 
intraplantar injection in the rat hind paw. Test animals were evaluated for paw 
withdrawal frequency (PWF%) in response to a 10 g Von Frey filament stimulus 
series (10 stimuli x 3 trials) prior to recruitment into the study (Pre-Treat), after 

25 subcutaneous intraplantar treatment with CPNv/A but before capsaicin (Pre- 
CAP), and following capsaicin challenge post-injection of CPNv/A (average of 
responses at 15' and 30'; CAP). Capsaicin challenge was achieved by injection 
of 10 pL of a 0.3% solution. Sample dilutions were prepared in 0.5% BSA/saline. 
These data are expressed as a normalized paw withdrawal frequency differential, 

30 in which the difference between the peak response (post-capsaicin) and the 
baseline response (pre-capsaicin) is expressed as a percentage. With this 

* * "^rialysisT'it" can be seen tftaf CPNV7A Is more P potenf than CPN/A since a lower'' 
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dose of CPNv/A is required to achieve similar analgesic effect to that seen with 
CPN/A. 

-Figure 22— Expressed /-purified LC/A-CPLE-hWA produet 

5 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
The electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPLE-A. Lane 1 = benchmark 
molecular mass markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = 
10 purified material following initial capture on Ni 2+ -charged Sepharose; Lane 4 = 
Factor Xa treated material prior to final capture on Ni 2+ -charged Sepharose; 
Lane 5 = purified material following second capture on Ni 2+ -charged Sepharose; 
Lane 6 = final purified material; Lane 7 = final purified material + DTT. 

15 Figure 23 - Expressed / purified LC/A-CPBE-H N /A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
The electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPBE-A. Lane 1 = total E. coli 

20 protein soluble fraction; Lane 2 = purified material following initial capture on Ni 2+ - 
charged Sepharose; Lane 3 = Factor Xa treated material prior to final capture on 
Ni 2+ -charged Sepharose; Lane 4 = purified final material post activation with 
Factor Xa (5 pi); Lane 5 = purified final material post activation with Factor Xa (10 
Ml); Lane 6 = purified final material post activation with Factor Xa (20 pi); Lane 7 = 

25 purified final material post activation with Factor Xa + DTT (5 pi); Lane 8 = 
purified final material post activation with Factor Xa + DTT (10 pi); Lane 9 = 
purified final material post activation with Factor Xa + DTT (20 pi); Lane 10 = 
benchmark molecular mass markers. 



30 



WO 2006/059093 



PCT/GB2005/004585 



43 

Figure 24 - Expressed / purified CPOP-A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
-The-eleetrophoresis profile indicates purification of a-disulphide-bonded di-chain 
5 species of the expected molecular mass of CPOP-A. Lane 1 = benchmark 
molecular mass markers; Lane 2 = purified material following initial capture on 
Ni 2+ -charged Sepharose; Lane 3 = Factor Xa treated material prior to final 
capture on Ni 2+ -charged Sepharose; Lane 4 = purified material following second 
capture on Ni 2+ -charged Sepharose; Lane 5 = purified final material post 
10 activation with Factor Xa (5 pi); Lane 6 = purified final material post activation 
with Factor Xa (10 pi); Lane 7 = purified final material post activation with Factor 
Xa (20 pi); Lane 8 = purified final material post activation with Factor Xa + DTT (5 
pi); Lane 9 = purified final material post activation with Factor Xa + DTT (10 pi); 
Lane 10 = purified final material post activation with Factor Xa + DTT (20 pi). 

15 

Figure 25 - Expressed / purified CPOPv-A product 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
The electrophoresis profile indicates purification of a disulphide-bonded di-chain 

20 species of the expected molecular mass of CPOPv-A. Lane 1 = benchmark 
molecular mass markers; Lane 2 = total E. coli protein soluble fraction; Lane 3 = 
purified material following initial capture on Ni 2+ -charged Sepharose; Lane 4 = 
Factor Xa treated material prior to final capture on Ni 2+ -charged Sepharose; 
Lane 5 = purified final material post activation with Factor Xa (5 pi); Lane 6 = 

25 purified final material post activation with Factor Xa (10 pi); Lane 7 = purified final 
material post activation with Factor Xa (20 pi); Lane 8 = purified final material 
post activation with Factor Xa + DTT (5 pi); Lane 9 = purified final material post 
activation with Factor Xa + DTT (10 pi); Lane 10 = purified final material post 
activation with Factor Xa + DTT (20 pi). 

30 
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Figure 26 - In vitro SNAP-25 cleavage in a DRG cell model 

Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
-eoneentrations- of-GP0Pv'A-for-24-hoursT-Gellular proteins were separated by 
5 SDS-PAGE, Western blotted, and probed with anti-SNAP-25 to facilitate an 
assessment of SNAP-25 cleavage. The percentage of cleaved SNAP-25 was 
calculated by densitometric analysis. 

Figure 27 - Expressed / purified CPNv-A-FXa-HT (removable his-tag) 

10 

Proteins were subjected to SDS-PAGE prior to staining with Coomassie Blue. 
The electrophoresis profile indicates purification of a disulphide-bonded di-chain 
species of the expected molecular mass of CPNv-A-FXa-HT. Lane 1 = 
benchmark molecular mass markers; Lane 2 = total E. coli protein soluble 
15 —fraction; Lane 3 = Factor ~Xa treated material prior to final capture on Ni 2+ -charged 
Sepharose; Lane 4 = purified final material post activation with Factor Xa; Lane 5 
= purified final material post activation with Factor Xa + DTT. 

Figure 28 - In vitro efficacy of LC/A-nociceptin-H N /A fusion proteins with 
20 variable spacer length, as assessed by ligand competition assay 

The ability of LC/A-nociceptin-hVA fusions of variable spacer length to bind to 
the ORLi receptor was assessed using a simple competition-based assay. 
Primary cultures of dorsal root ganglia (DRG) were exposed to varying 

25 concentrations of test material in the presence of 1 nM [3H]-nociceptin. The 
reduction in specific binding of the radiolabeled ligand was assessed by 
scintillation counting, and plotted in comparison to the efficacy of unlabelled 
ligand (Tocris nociceptin). The upper panel illustrates the displacement 
characteristics of the GS0, GS20, GS30 and Hx27 spacers, whilst the lower 

30 panel illustrates the displacement achieved by the GS10, GS15 and GS25 
spaced fusion proteins. It is concluded that the GS0 and GS30 spacers are 
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ineffective, and the GS10 is poorly effective, at displacing nociceptin from the 
ORL1 receptor. 



Figure-29-'-/n-Wfro- efficacy of- LG/A-noclceptin-H N /A fusion proteins with 
5 variable spacer length, as assessed by in vitro SNAP-25 cleavage 



Primary cultures of dorsal root ganglia (DRG) were exposed to varying 
concentrations of CPN-A (of variable spacer length) for 24 hours. Cellular 
proteins were separated by SDS-PAGE, Western blotted, and probed with anti- 

10 SNAP-25 to facilitate an assessment of SNAP-25 cleavage. The percentage of 
cleaved SNAP-25 was calculated by densitometric analysis. The poorly effective 
binding characteristics of the GS10 spaced fusion protein (see Figure 28) are 
reflected in the higher concentrations of fusion required to achieve cleavage of 
intracellular SNAP-25. GS0 and GS30 spaced fusion proteins were completely 

15 ineffective (date not shown). GS15, 20 and 25 spaced fusion proteins were 
similarly effective. 



SEQ ID NOs 



20 SEQ ID1 DNA sequence of the LC/A 

SEQ ID2 DNA sequence of the hWA 

SEQ I D3 DNA sequence of the LC/B 

SEQ ID4 DNA sequence of the H N /B 

SEQ I D5 DNA sequence of the LC/C 

25 SEQID6 DNA sequence of the H N /C 

SEQ ID7 DNA sequence of the CPN-A linker 

SEQ ID8 DNA sequence of the A linker 

SEQ ID9 DNA sequence of the N-terminal presentation nociceptin insert 

SEQ ID10 DNA sequence of the CPN-C linker 

30 SEQ ID1 1 DNA sequence of the CPBE-A linker 

SEQ ID12 DNA sequence of the CPNvar-A linker 

SEQ ID1 3 DNA sequence of the LC/A-CPN-H N /A fusion 
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SEQ ID14 Protein sequence of the LC/A-CPN-Hn/A fusion 

SEQ ID1 5 DNA sequence of the N-LC/A-H N /A fusion 

SEQ ID1 6 Protein sequence of the N-LC/A-H N /A fusion 

-SEQ-ID-17 DNA sequence of the LC/C-CPN-Hn/C fusion • - 

5 SEQ ID18 Protein sequence of the LC/C-CPN-H N /C fusion 

SEQ ID1 9 DNA sequence of the LC/C-CPN-H N /C (A-linker) fusion 

SEQ ID20 Protein sequence of the LC/C-CPN-H N /C (A-linker) fusion 

SEQ ID21 DNA sequence of the LC/A-CPME-H N /A fusion 

SEQ ID22 Protein sequence of the LC/A-CPME-H N /A fusion 

10 SEQ ID23 DNA sequence of the LC/A-CPBE-H N /A fusion 

SEQ ID24 Protein sequence of the LC/A-CPBE-H N /A fusion 

SEQ ID25 DNA sequence of the LC/A-CPNv-H N /A fusion 

SEQ ID26 Protein sequence of the LC/A-CPNv-H N /A fusion 

SEQ ID27 DNA sequence of the LC/A-CPN[1 -1 1 J-HN/A fusion 

15 SEQ ID28 Protein sequence of the LC/A-CPN[1-1 1]-HN/A fusion 

SEQ ID29 DNA sequence of the LC/A-CPN[[Y10]1 -11 J-HN/A fusion 

SEQ ID30 Protein sequence of the LC/A-CPN[[Y10]1-1 1]-HN/A fusion 

SEQ ID31 DNA sequence of the LC/A-CPN[[Y1 1J1-1 1]-HN/A fusion 

SEQ ID32 Protein sequence of the LC/A-CPN[[Y1 1J1-1 1 J-HN/A fusion 

20 SEQ ID33 DNA sequence of the LC/A-CPN[[Y14J1-17]-HN/A fusion 

SEQ ID34 Protein sequence of the LC/A-CPN[[Y14]1-1 7J-HN/A fusion 

SEQ ID35 DNA sequence of the LC/A-CPN[1 -1 3J-HN/A fusion 

SEQ ID36 Protein sequence of the LC/A- CPN[1 -1 3J-HN/A fusion 

SEQ ID37 DNA sequence of CPN[1 -1 7] 

25 SEQ ID38 Protein Sequence of CPN[1-17] 

SEQ ID39 DNA sequence of CPN[1-1 1] 

SEQ ID40 Protein sequence of CPN[1-1 1] 

SEQ ID41 DNA sequence of CPN[[Y10]1-11J 

SEQ ID42 Protein sequence of CPN[[Y10]1-1 1] 

30 SEQ ID43 DNA sequence of CPN[[Y1 1J1-1 1J 

SEQ ID44 Protein sequence of CPN[[Y11]1-1 1J 

r ^ SEQ ID45- DNA sequence 6f CPN[[Y14]1-17] ~ -~*» 
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SEQ ID46 


Protein sequence of CPN[[Y14)1-17] 




SEQ ID47 


DNA sequence of CPN[1-13] 




SEQ ID48 


Protein sequence of CPN[1-13] 




— SEQ-ID49 


- DNA-sequenee -of GPNv-(also-known-as-N[[R14K-1 5] 1 -1-7-]) - 


5 


SEQ ID50 


Protein sequence of CPNv (also known as N[[R14K15]1-17]) 




SEQ ID51 


DNA sequence of the nociceptin-spacer-LC/A-HN/A fusion 




SEQ ID52 


Protein sequence of the nociceptin-spacer-LC/A-HN/A fusion 




SEQ ID53 


DNA sequence of the CPN-A GS10 linker 




SEQ ID54 


DNA sequence of the CPN-A GS15 linker 


10 


SEQ ID55 


DNA sequence of the CPN-A GS25 linker 




SEQ ID56 


DNA sequence of the CPN-A GS30 linker 




SEQ ID57 


DNA sequence of the CPN-A HX27 linker 




SEQ ID58 


DNA sequence of the LC/A-CPN(GS15)-H N /A fusion 




SEQ ID59 


Protein sequence of the LC/A-CPN(GS15)-H N /A fusion 


15 


SEQ ID60 


DNA sequence of the LC/A-CPN(GS25)-Hn/A fusion 




SEQ ID61 


Protein sequence of the LC/A-CPN(GS25)-H N /A fusion 




SEQ ID62 


DNA sequence of the CPNvar-A Enterokinase activatable linker 




SEQ ID63 


DNA sequence of the LC/A-CPNv(Ek)-H N /A fusion 




SEQ ID64 


Protein sequence of the LC/A-CPNv(Ek)-H N /A fusion 


20 


SEQ ID65 


DNA sequence of the CPNvar-A linker 




SEQ ID66 


DNA sequence of the LC/C-CPNv-H N /C fusion (act. A) 




SEQ ID67 


Protein sequence of the LC/C-CPNv-Hn/C fusion (act. A) 




SEQ ID68 


DNA sequence of the LC/A-CPLE-H N /A fusion 




SEQ ID69 


Protein sequence of the LC/A-CPLE-H N /A fusion 


25 


SEQ ID70 


DNA sequence of the LC/A-CPOP-H N /A fusion 




SEQ ID71 


Protein sequence of the LC/A-CPOP-H N /A fusion 




SEQ ID72 


DNA sequence of the LC/A-CPOPv-H N /A fusion 




SEQ ID73 


Protein sequence of the LC/A-CPOPv-H N /A fusion 




SEQ ID74 


DNA sequence of the IgA protease 


30 


SEQ ID75 


DNA sequence of the lgA-CPNv-H N /A fusion 




SEQ ID76 


Protein sequence of the lgA-CPNv-H N /A fusion 




--SECHD77 


**DNA~sequehce of the FXa-HT - ■ 
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DNA sequence of the CPNv-A-FXa-HT 
Protein sequence of the CPNv-A-FXa-HT fusion 
DNA sequence of the DT translocation domain 
DNA sequence of the CPLE-DT-A 
Protein sequence of the CPLE-DT-A fusion 
DNA sequence of the TeNT LC 
DNA sequence of the CPNv-TENT LC 
Protein sequence of the CPNV-TeNT LC fusion 
DNA sequence of the CPNvar-C linker 
10 SEQ ID87 DNA sequence of the LC/C-CPNv-H N /C fusion (act. C) 
Protein sequence of the LC/C-CPNv-Hn/C fusion (act. C) 

Examples 



SEQ 


ID78 


SEQ 


ID79 


SEQ 


ID80 


SEQ 


ID81 


SEQ 


ID82 


SEQ 


ID83 


opn 
otu 




SEQ 


ID85 


SEQ 


ID86 


SEQ 


ID87 


SEQ 


ID88 



15 Example 1 - Preparation of a LC/A and H N /A backbone clones 



The following procedure creates the LC and Hn fragments for use as the 
component backbone for multidomain fusion expression. This example is based 
on preparation of a serotype A based clone (SEQ ID1 and SEQ ID2), though the 
20 procedures and methods are equally applicable to the other serotypes [illustrated 
by the sequence listing for serotype B (SEQ ID3 and SEQ ID4) and serotype C 
(SEQ ID5 and SEQ ID6)]. 



Preparation of cloning and expression vectors 

25 pCR 4 (Invitrogen) is the chosen standard cloning vector, selected due to the lack 
of restriction sequences within the vector and adjacent sequencing primer sites 
for easy construct confirmation. The expression vector is based on the pMAL 
(NEB) expression vector, which has the desired restriction sequences within the 
multiple cloning site in the correct orientation for construct insertion (SamHI-Sa/l- 

30 Psfl-H/'ndlll). A fragment of the expression vector has been removed to create a 
non-mobilisable plasmid and a variety of different fusion tags have been inserted 
v.-vo , * ito increase* purification options. - • • * ■ 
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Preparation of protease (e.g. LC/A) insert 

The LC/A (SEQ ID1 ) is created by one of two ways: 

The DNA sequence -is- designedly -back- translation of the LC/A amino acid 
5 sequence [obtained from freely available database sources such as GenBank 
(accession number P10845) or Swissprot (accession locus BXA1_CLOBO) using 
one of a variety of reverse translation software tools (for example EditSeq best E. 
coli reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 
(Entelechon)]. BamH\ISal\ recognition sequences are incorporated at the 5' and 

10 3' ends respectively of the sequence, maintaining the correct reading frame. The 
DNA sequence is screened (using software such as MapDraw, DNASTAR Inc.) 
for restriction enzyme cleavage sequences incorporated during the back 
translation. Any cleavage sequences that are found to be common to those 
required by the cloning system are removed manually from the proposed coding 

15 sequence ensuring common E. coli codon usage is maintained. E. coli codon 
usage is assessed by reference to software programs such as Graphical Codon 
Usage Analyser (Geneart), and the %GC content and codon usage ratio 
assessed by reference to published codon usage tables (for example GenBank 
Release 143, 13 September 2004). This optimised DNA sequence containing the 

20 LC/A open reading frame (ORF) is then commercially synthesized (for example 
by Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 4 vector. 

The alternative method is to use PCR amplification from an existing DNA 
sequence with BamH\ and Sal\ restriction enzyme sequences incorporated into 

25 the 5' and 3' PCR primers respectively. Complementary oligonucleotide primers 
are chemically synthesised by a supplier (for example MWG or Sigma-Genosys), 
so that each pair has the ability to hybridize to the opposite strands (3' ends 
pointing "towards" each other) flanking the stretch of Clostridium target DNA, one 
oligonucleotide for each of the two DNA strands. To generate a PCR product the 

30 pair of short oligonucleotide primers specific for the Clostridium DNA sequence 
are mixed with the Clostridium DNA template and other reaction components and 
, --. 4 r ** <--r ^placed -in -a ^machine (the 'PCR machine') that can change -the incubation- 
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temperature of the reaction tube automatically, cycling between approximately 
94°C (for denaturation), 55°C (for oligonucleotide annealing), and 72°C (for 
synthesis). Other reagents required for amplification of a PCR product include a 
DNA polymerase (such as Tag or Pfu polymerase), each of the four nucleotide 
5 dNTP building blocks of DNA in equimolar amounts (50-200 uM) and a buffer 
appropriate for the enzyme optimised for Mg 2+ concentration (0.5-5 mM). 

The amplification product is cloned into pCR 4 using either, TOPO TA cloning for 
Taq PCR products or Zero Blunt TOPO cloning for Pfu PCR products (both kits 
10 commercially available from Invitrogen). The resultant clone is checked by 
sequencing. Any additional restriction sequences which are not compatible with 
the cloning system are then removed using site directed mutagenesis [for 
example, using Quickchange (Stratagene Inc.)]. 

1 5 — Preparation of translocation (e.g. Hn) insert 

The Hn/A (SEQ ID2) is created by one of two ways: 

The DNA sequence is designed by back translation of the H N /A amino acid 
sequence [obtained from freely available database sources such as GenBank 
(accession number P10845) or Swissprot (accession locus BXA1_CLOBO)] 

20 using one of a variety of reverse translation software tools [for example EditSeq 
best E. coli reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 
(Entelechon)]. A Pstl restriction sequence added to the N-terminus and Xbal- 
stop codon-W/ndlll to the C-terminus ensuring the correct reading frame is 
maintained. The DNA sequence is screened (using software such as MapDraw, 

25 DNASTAR Inc.) for restriction enzyme cleavage sequences incorporated during 
the back translation. Any sequences that are found to be common to those 
required by the cloning system are removed manually from the proposed coding 
sequence ensuring common E. coli codon usage is maintained. E. coli codon 
usage is assessed by reference to software programs such as Graphical Codon 

30 Usage Analyser (Geneart), and the %GC content and codon usage ratio 
assessed by reference to published codon usage tables (for example GenBank 



WO 2006/059093 Best Available Copy 



PCT/GB2005/004585 



51 

Release 143, 13 September 2004). This optimised DNA sequence is then 
commercially synthesized (for example by Entelechon, Geneart or Sigma- 
Genosys) and is provided in the pCR 4 vector. 

5 The alternative method is to use PCR amplification from an existing DNA 
sequence with Psfl and Xbal-stop codon-tf/ndtll restriction enzyme sequences 
incorporated into the 5* and 3' PCR primers respectively. The PCR amplification 
is performed as described above. The PCR product is inserted into pCR 4 vector 
and checked by sequencing. Any additional restriction sequences which are not 

10 compatible with the cloning system are then removed using site directed 
mutagenesis [for example using Quickchange (Stratagene Inc.)]. 

Example 2 - Preparation of a LC/A-nociceptin-H N /A fusion protein 
(nociceptin is N-terminal of the H N -chain) 

15 

Preparation of linker-nociceptin-spacer insert 

The LC-Hn linker can be designed from first principle, using the existing sequence 
information for the linker as the template. For example, the serotype A linker (in 
this case defined as the inter-domain polypeptide region that exists between the 

20 cysteines of the disulphide bridge between LC and H N ) is 23 amino acids long 
and has the sequence VRGIITSKTKSLDKGYNKALNDL. Within this sequence, it 
is understood that proteolytic activation in nature leads to an Hn domain that has 
an N-terminus of the sequence ALNDL. This sequence information is freely 
available from available database sources such as GenBank (accession number 

25 P10845) or Swissprot (accession locus BXA1_CLOBO). Into this linker a Factor 
Xa site, nociceptin and spacer are incorporated; and using one of a variety of 
reverse translation software tools [for example EditSeq best E. coli reverse 
translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)], the DNA 
sequence encoding the linker-ligand-spacer region is determined. Restriction 

30 sites are then incorporated into the DNA sequence and can be arranged as 
eamHI-Sa/l-linker-protease site-nociceptin-A/hel-spacer-Spel-Psfl-Xbal-stop 
codon-H/ndlll (SECT IDT). It is. important 'to ensure the correct reading frame is * 
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maintained for the spacer, nociceptin and restriction sequences and that the Xbal 
sequence is not preceded by the bases, TC, which would result on DAM 
methylation. The DNA sequence is screened for restriction sequence 
incorporation: and any additional sequences- are removed manually from the 

5 remaining sequence ensuring common E. coli codon usage is maintained. E. coli 
codon usage is assessed by reference to software programs such as Graphical 
Codon Usage Analyser (Geneart), and the %GC content and codon usage ratio 
assessed by reference to published codon usage tables (for example, GenBank 
Release 143, 13 September 2004). This optimised DNA sequence is then 

10 commercially synthesized (for example by Entelechon, Geneart or Sigma- 
Genosys) and is provided in the pCR 4 vector. 

Preparation of the LC/A-nociceptin-Hi/A fusion 

In order to create the LC-linker-nociceptin-spacer-H N construct (SEQ ID13), the 
15 pCR 4 vector encoding the linker (SEQ ID7) is cleaved with SamHI + Sail 
restriction enzymes. This cleaved vector then serves as the recipient vector for 
insertion and ligation of the LC/A DNA (SEQ ID1) cleaved with BamH\ + Sa/I. 
The resulting plasmid DNA is then cleaved with Psfl + Xbal restriction enzymes 
and serves as the recipient vector for the insertion and ligation of the Hn/A DNA 
20 (SEQ ID2) cleaved with Psfl + Xbal. The final construct contains the LC-linker- 
nociceptin-spacer-HN ORF (SEQ ID13) for transfer into expression vectors for 
expression to result in a fusion protein of the sequence illustrated in SEQ ID14. 

Example 3 - Preparation of a nociceptin-LC/A-HN/A fusion protein 
25 (nociceptin is N-terminal of the LC-chain) 

The LC/A-Hn/A backbone is constructed as described in Example 2 using the 
synthesised A serotype linker with the addition of a Factor Xa site for activation, 
arranged as SamHI-Sa/l-linker-protease site-linker-Psfl-Xbal-stop codon-H/ndlll 
30 (SEQ ID8). The LC/A-H N /A backbone and the synthesised N-terminal 
presentation nociceptin insert (SEQ ID9) are cleaved with 8amHI + Hind\\\ 
... • >m-*r re'gtricfibfi enzyrtfels? §ei purified' arid ligated fog&hepfo" "'dreate a nociceptive -ffl 
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spacer-LC-linker-H N . The ORF (SEQ ID15) is then cut out using restriction 
enzymes Ava\ + Xba\ for transfer into expression vectors for expression to result 
in a fusion protein of the sequence illustrated in SEQ ID16. 

5 Example 4 - Preparation of a LC/C-nociceptin-H N /C fusion protein 

Following the methods used in Examples 1 and 2, the LC/C (SEQ ID5) and H N /C 
(SEQ ID6) are created and inserted into the C serotype linker arranged as 
BamHI-Sa/l-linker-protease site-nociceptin-A//?el-spacer-Spel-Psfl-Xoal-stop 
10 codon-H/'ndlll (SEQ ID10). The final construct contains the LC-linker-nociceptin- 
spacer-H N ORF (SEQ ID17) for expression as a protein of the sequence 
illustrated in SEQ ID18. 

Example 5 • Preparation of a LC/C-nociceptin-H N /C fusion protein with a 
15 serotype A activation sequence 

Following the methods used In Examples 1 and 2, the LC/C (SEQ ID5) and H N /C 
(SEQ ID6) are created and inserted into the A serotype linker arranged as 
BamHI-Sa/l-linker-protease site-nociceptin-Wfiel-spacer-Spel-Psfl-Xoal-stop 
20 codon-H/ndlll (SEQ ID7). The final construct contains the LC-linker-nociceptin- 
spacer-HN ORF (SEQ ID19) for expression as a protein of the sequence 
illustrated in SEQ ID20. 

Example 6 - Preparation of a LC/A-met enkephalin-H N /A fusion protein 

25 

Due to the small, five-amino acid, size of the met-enkephalin ligand the LC/A-met 
enkephalin-HN/A fusion is created by site directed mutagenesis [for example 
using Quickchange (Stratagene Inc.)] using the LC/A-nociceptin-Hfvi/A fusion 
(SEQ ID13) as a template. Oligonucleotides are designed encoding the YGGFM 

30 met-enkephalin peptide, ensuring standard E.coli codon usage is maintained and 
no additional restriction sites are incorporated, flanked by sequences 

■ A complimentary to" thelinker region of the tC/A^hociceptin-tWA ftasion (SEQ ID1 3) 
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either side on the nociceptin section. The SDM product is checked by 
sequencing and the final construct containing the LC-linker-met enkephalin- 
spacer-H N ORF (SEQ ID21) for expression as a protein of the sequence 
-illustrated in SEQ ID22. - 

5 

Example 7 - Preparation of a LC/A-p endorphin-H N /A fusion protein 

Following the methods used in Examples 1 and 2, the LC/A (SEQ ID1) and H N /A 
(SEQ ID2) are created and inserted into the A serotype B endorphin linker 
10 arranged as BamHI-Sa/l-linker-protease site-B endorphin-A/nel-spacer-Spel-Psfl- 
Xbal-stop codon-H/ndlll (SEQ ID1 1 ). The final construct contains the LC-linker-B 
endorphin-spacer-H N ORF (SEQ ID23) for expression as a protein of the 
sequence illustrated in SEQ ID24. 

15 Example 8 - Preparation of a LC/A-nociceptin variant-H N /A fusion protein 

Following the methods used in Examples 1 and 2, the LC/A (SEQ ID1) and H N /A 
(SEQ ID2) are created and inserted into the A serotype nociceptin variant linker 
arranged as 8amHI-Sa/l-linker-protease site-nociceptin variant-A/nel-spacer- 
20 Spel-Psfl-Xbal-stop codon-H/ndlll (SEQ ID12). The final construct contains the 
LC-linker-nociceptin variant-spacer-H N ORF (SEQ ID25) for expression as a 
protein of the sequence illustrated in SEQ ID26. 

Example 9 - Purification method for LC/A-nociceptin-H N /A fusion protein 

25 

Defrost falcon tube containing 25 ml 50 mM HEPES pH 7.2, 200 mM NaCI and 
approximately 10 g of E. coli BL21 cell paste. Make the thawed cell paste up to 
80 ml with 50 mM HEPES pH 7.2, 200 mM NaCI and sonicate on ice 30 seconds 
on, 30 seconds off for 10 cycles at a power of 22 microns ensuring the sample 
30 remains cool. Spin the lysed cells at 18 000 rpm, 4°C for 30 minutes. Load the 
supernatant onto a 0.1 M NiSQ 4 charged Chelating column (20-30 ml column is 
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sufficient) equilibrated with 50 mM HEPES pH 7.2, 200 mM NaCI. Using a step 
gradient of 10 and 40 mM imidazol, wash away the non-specific bound protein 
and elute the fusion protein with 100 mM imidazol. Dialyse the eluted fusion 
protein against 5 L of 50 mM HEPES pH 7.2, 200 mM NaCI at 4°C overnight and 

5 measure the OD of the dialysed fusion protein. Add 1 unit of factor Xa per 100 
ug fusion protein and Incubate at 25°C static overnight. Load onto a 0.1 M NiS0 4 
charged Chelating column (20-30 ml column is sufficient) equilibrated with 50 mM 
HEPES pH 7.2, 200 mM NaCI. Wash column to baseline with 50 mM HEPES pH 
7.2, 200 mM NaCI. Using a step gradient of 10 and 40 mM imidazol, wash away 

10 the non-specific bound protein and elute the fusion protein with 100 mM imidazol. 
Dialyse the eluted fusion protein against 5 L of 50 mM HEPES pH 7.2, 200 mM 
NaCI at 4°C overnight and concentrate the fusion to about 2 mg/ml, aliquot 
sample and freeze at -20°C. Test purified protein using OD, BCA, purity analysis 
and SNAP-25 assessments. 

15 

Example 10 - Confirmation of TM Agonist Activity by measuring release of 
substance P from neuronal cell cultures 

Materials 

20 Substance P EIA is obtained from R&D Systems, UK. 
Methods 

Primary neuronal cultures of eDRG are established as described previously 
(Duggan et al., 2002). Substance P release from the cultures is assessed by 

25 EIA, essentially as described previously (Duggan et al., 2002). The TM of 
interest is added to the neuronal cultures (established for at least 2 weeks prior to 
treatment); control cultures are performed in parallel by addition of vehicle in 
place of TM. Stimulated (100 mM KCI) and basal release, together with total cell 
lysate content, of substance P are obtained for both control and TM treated 

30 cultures. Substance P immunoreactivity is measured using Substance P Enzyme 
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Immunoassay Kits (Cayman Chemical Company, USA or R&D Systems, UK) 
according to manufacturers' instructions. 

-The amount of Substance P released by the neuronal cells in the presence of the 
5 TM of interest is compared to the release obtained in the presence and absence 
of 100 mM KCI. Stimulation of Substance P release by the TM of interest above 
the basal release, establishes that the TM of interest is an "agonist ligand" as 
defined in this specification. If desired the stimulation of Substance P release by 
the TM of interest can be compared to a standard Substance P release-curve 
10 produced using the natural ORL-1 receptor ligand, nociceptin (Tocris). 

Example 11 - Confirmation of ORLi receptor activation by measuring 
forskolin-stimulated cAMP production 

15 -Confirmation- that-a- given TM is acting via-the ORI_i receptor is- provided by the 
following test, in which the TMs ability to inhibit forskolin-stimulated cAMP 
production is assessed. 

Materials 

20 [ 3 H]adenine and [ 14 C]cAMP are obtained from GE Healthcare 
Methods 

The test is conducted essentially as described previously by Meunier et al. 
[Isolation and structure of the endogenous agonist of opioid receptor-like ORLi 
25 receptor. Nature 377: 532-535, 1995] in intact transfected-CHO cells plated on 
24-well plastic plates. 

To the cells is added [3H]adenine (1 .0 uCi) in 0.4 ml of culture medium. The cells 
remain at 37°C for 2 h to allow the adenine to incorporate into the intracellular 
30 ATP pool. After 2 h, the cells are washed once with incubation buffer containing: 
130 mM NaCI, 4.8 mM KCI, 1.2 mM KH 2 P0 4 , 1.3 mM CaCI 2 , 1.2 mM MgS0 4 , 10 
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mM glucose, 1 mg/ml bovine serum albumin and 25 mM HEPES pH 7.4, and 
replaced with buffer containing forskolin (10 uM) and isobutylmethylxanthine (50 
uM) with or without the TM of interest. After 10 min, the medium is aspirated and 
- repteeed-with 0.5 ml, 9&-M-H6I.- -Approximately 1000 cpm of [ 14 C]cAMP is added 
5 to each well and used as an internal standard. The contents of the wells are then 
transferred to columns of 0.65 g dry alumina powder. The columns are eluted 
with 4 ml of 5 mM HCI, 0.5 ml of 0.1 M ammonium acetate, then two additional 
millilitres of ammonium acetate. The final eluate is collected into scintillation vials 
and counted for 14 C and tritium. Amounts collected are corrected for recovery of 
10 [ 14 C]cAMP. TMs that are agonists at the ORI_i receptor cause a reduction in the 
level of cAMP produced in response to forskolin. 

Example 12 - Confirmation of ORU receptor activation using a GTPyS 
binding functional assay 

15 

Confirmation that a given TM is acting via the ORLi receptor is also provided by 
the following test, a GTPyS binding functional assay. 

Materials 

20 [ 35 S]GTPyS is obtained from GE Healthcare 

Wheatgerm agglutinin-coated (SPA) beads are obtained from GE Healthcare 

Methods 

This assay is carried out essentially as described by Traynor and Nahorski 
25 [Modulation by p-opioid agonists of guanosine-5 -0-(3-[ 35 S]thio)triphosphate 
binding to membranes from human neuroblastoma SH-SY5Y cells. Mol. 
Pharmacol. 47: 848-854, 1 995]. 

Cells are scraped from tissue culture dishes into 20 mM HEPES, 1 mM 
30 ethylenediaminetetraacetic acid, then centrifuged at 500 * g for 10 min. Cells are 
resuspended in this buffer and homogenized with a Polytron Homogenizer. 
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The homogenate is centrifuged at 27,000 * g for 15 min, and the pellet 
resuspended in buffer A, containing: 20 mM HEPES, 10 mM MgCI 2 , 100 mM 
NaCI, pH 7.4. The suspension is recentrifuged at 20,000 * g and suspended 
once- more in- buffer A -For -the -binding assay, membranes (8-15 ug protein) are 
5 incubated with [ 35 S]GTP S (50 pM), GDP (10 uM), with and without the TM of 
interest, in a total volume of 1.0 ml, for 60 min at 25°C. Samples are filtered over 
glass fibre filters and counted as described for the binding assays. 

Example 13 - Preparation of a LC/A-nociceptln-HN/A fusion protein 
10 (nociceptin is N-terminal of the H N -chain) 

The linker-nociceptin-spacer insert is prepared as described in Example 2. 

Preparation of the LC/A-nociceptin-Hi^A fusion 

15 In order to create the LC-linker-nociceptin-spacer-H N construct (SEQ ID13), the 
pCR 4 vector encoding the linker (SEQ ID7) is cleaved with BamHI + Sa/I 
restriction enzymes. This cleaved vector then serves as the recipient for insertion 
and ligation of the LC/A DNA (SEQ ID1) also cleaved with BamHI + Sa/I. The 
resulting plasmid DNA is then cleaved with BamHI + HindUi restriction enzymes 

20 and the LC/A-linker fragment inserted into a similarly cleaved vector containing a 
unique multiple cloning site for BamHI, Sa/I, Psfl, and HindlU such as the pMAL 
vector (NEB). The H N /A DNA (SEQ ID2) is then cleaved with Psfl + Hind\\\ 
restriction enzymes and inserted into the similarly cleaved pMAL-LC/A-linker 
construct. The final construct contains the l_C-linker-nociceptin-spacer-H N ORF 

25 (SEQ ID1 3) for expression as a protein of the sequence illustrated in SEQ ID14. 

Example 14 - Preparation of a nociceptin-LC/A-H N /A fusion protein 
(nociceptin is N-terminal of the LC-chain) 

30 In order to create the nociceptin-spacer-LC/A-H N /A construct, an A serotype 
linker with the addition of a Factor Xa site for activation, arranged as BamH\-Sal\- 
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linker-protease site-linker-Psfl-Xbal-stop codon-H/ndlll (SEQ ID8) is synthesised 
as described in Example 13. The pCR 4 vector encoding the linker is cleaved 
with fiamHI + Sa/I restriction enzymes. This cleaved vector then serves as the 
recipient for insertion and- ligation of the-L-G/A DNA (SEQ ID1) also cleaved with 

5 BamHI + Sa/I. The resulting plasmid DNA is then cleaved with SamHI + Hind\\\ 
restriction enzymes and the LC/A-linker fragment inserted into a similarly cleaved 
vector containing the synthesised N-terminal presentation nociceptin insert (SEQ 
ID9). This construct is then cleaved with Aval + Hind\\\ and inserted into an 
expression vector such as the pMAL plasmid (NEB). The H N /A DNA (SEQ ID2) is 

10 then cleaved with Psfl + Hind\l\ restriction enzymes and inserted into the similarly 
cleaved pMAL-nociceptin-LC/A-linker construct. The final construct contains the 
nociceptin-spacer-LC/A-H N /A ORF (SEQ ID51) for expression as a protein of the 
sequence illustrated in SEQ ID52. 

15 Example 15 - Preparation and purification of an LC/A-nociceptin-H N /A 
fusion protein family with variable spacer length 

Using the same strategy as employed in Example 2, a range of DNA linkers were 
prepared that encoded nociceptin and variable spacer content. Using one of a 

20 variety of reverse translation software tools [for example EditSeq best E. coli 
reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)], 
the DNA sequence encoding the linker-ligand-spacer region is determined. 
Restriction sites are then incorporated into the DNA sequence and can be 
arranged as SamHI-Sa/l-linker-protease site-nociceptin-Miel-spacer-Spel-Psfl- 

25 Xbal-stop codon-H//?dlll (SEQ ID53 to SEQ ID57). It is important to ensure the 
correct reading frame is maintained for the spacer, nociceptin and restriction 
sequences and that the Xbal sequence is not preceded by the bases, TC which 
would result on DAM methylation. The DNA sequence is screened for restriction 
sequence incorporation and any additional sequences are removed manually 

30 from the remaining sequence ensuring common E. coli codon usage is 
maintained. E. coli codon usage is assessed by reference to software programs 
such as Graphical Codon Usage Analyser (Geneart), and the %GC content and 
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codon usage ratio assessed by reference to published codon usage tables (for 
example GenBank Release 143, 13 September 2004). This optimised DNA 
sequence is then commercially synthesized (for example by Entelechon, Geneart 
or Sigma-Genosys) and is provided in the pCR 4 vector. 

5 



The spacers that were created included: 



Code 


Protein sequence of the linker 


SEQ ID of the 
linker DNA 


GS10 


ALAGGGGSALVLQ 


53 


GS15 


ALAGGGGSGGGGSALVLQ 


54 


GS25 


ALAGGGGSGGGGSGGGGSGGGGSALVLQ 


55 


GS30 


ALAGGGGSGGGGSGGGGSGGGGSGGGGSALVLQ 


56 


HX27 


ALAAEAAAKEAAAKEAAAKAGGGGSALVLQ 


57 



Table 1 



By way of example, in order to create the LC/A-CPN(GS15)-Hn/A fusion construct 
10 (SEQ ID58), the pCR 4 vector encoding the linker (SEQ ID54) is cleaved with 
BamHI + Sa/I restriction enzymes. This cleaved vector then serves as the 
recipient vector for insertion and ligation of the LC/A DNA (SEQ ID1) also cleaved 
with BamHI + Sa/I. The resulting plasmid DNA is then cleaved with BamHI + 
HindlU restriction enzymes and the LC/A-linker fragment inserted into a similarly 
15 cleaved vector containing a unique multiple cloning site for BamHI, Sa/I, Psfl, and 
Hind\\\ such as the pMAL vector (NEB). The H N /A DNA (SEQ ID2) is then 
cleaved with Psfl + HindlU restriction enzymes and inserted into the similarly 
cleaved pMAL-LC/A-linker construct. The final construct contains the LC/A- 
CPN(GS15)-H N /A ORF (SEQ ID58) for expression as a protein of the sequence 
20 illustrated in SEQ ID59. 

As a further example, to create the LC/A-CPN(GS25)-H N /A fusion construct (SEQ 
ID60), the pCR 4 vector encoding the linker (SEQ ID55) is cleaved with BamHI + 
Sa/I restriction enzymes. This cleaved vector then serves as the recipient vector 
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for insertion and ligation of the LC/A DNA (SEQ ID1) cleaved with BamHI + Sail. 
The resulting plasmid DNA is then cleaved with BamHI + Hind\\\ restriction 
enzymes and the LC/A-linker fragment inserted into a similarly cleaved vector 
containing a unique multiple cloning site for BamHI, Sa/I, Pst\, and Hind\\\ such 
5 as the pMAL vector (NEB). The Hn/A DNA (SEQ ID2) is then cleaved with Pst\ + 
Hind\\\ restriction enzymes and inserted into the similarly cleaved pMAL-LC/A- 
linker construct. The final construct contains the LC/A-CPN(GS25)-H N /A ORF 
(SEQ ID60) for expression as a protein of the sequence illustrated in SEQ ID61 . 

10 Variants of the LC/A-CPN-Hn/A fusion consisting of GS10, GS30 and HX27 are 
similarly created. Using the purification methodology described in Example 9, 
fusion protein is purified from E. coli cell paste. Figure 9 illustrates the purified 
product obtained in the case of LC/A-CPN(GS10)-H N /A, LC/A-CPN(GS15)-Hn/A, 
LC/A-CPN(GS25)-H n /A, LC/A-CPN(GS30)-H N /A and LC/A-CPN(HX27)-Hn/A. 

15 — - • - ••■ 

Example 16 - Assessment of in vitro efficacy of an LC/A-nociceptin-H N /A 
fusion 

Fusion protein prepared according to Examples 2 and 9 was assessed in the 
20 eDRG neuronal cell model. 

Assays for the inhibition of substance P release and cleavage of SNAP-25 have 
been previously reported (Duggan et a/., 2002, J. Biol. Chem., 277, 34846- 
34852). Briefly, dorsal root ganglia neurons are harvested from 15-day-old fetal 

25 Sprague-Dawley rats and dissociated cells plated onto 24-well plates coated with 
Matrigel at a density of 1 x 10 6 cells/well. One day post-plating the cells are 
treated with 10 uM cytosine B-D-arabinofuranoside for 48 h. Cells are maintained 
in Dulbecco's minimal essential medium supplemented with 5% heat-inactivated 
fetal bovine serum, 5 mM L-glutamine, 0.6% D-glucose, 2% B27 supplement, and 

30 100 ng/ml 2.5S mouse nerve growth factor. Cultures are maintained for 2 weeks 
at 37°C in 95% air/5% CQ 2 before addition of test materials. 
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Release of substance P from eDRG is assessed by enzyme-linked 
immunosorbent assay. Briefly, eDRG cells are washed twice with low pptassium- 
-balanced salt solution (BSS: -5 mM KCI, 137 mM NaCI, 1.2 mM MgCfe, 5 mM 

5 glucose, 0.44 mM KH 2 P0 4 , 20 mM HEPES, pH 7.4, 2 mM CaCI 2 ). Basal 
samples are obtained by incubating each well for 5 min. with 1 ml of low 
potassium BSS. After removal of this buffer, the cells are stimulated to release 
by incubation with 1 ml of high potassium buffer (BSS as above with modification 
to include 100 mM KCI isotonically balanced with NaCI) for 5 min. All samples 

10 are removed to tubes on ice prior to assay of substance P. Total cell lysates are 
prepared by addition of 250 pi of 2 M acetic acid/0.1% trifluoroacetic acid to lyse 
the cells, centrifugal evaporation, and resuspension in 500 pi of assay buffer. 
Diluted samples are assessed for substance P content. Substance P 
immunoreactivity is measured using Substance P Enzyme Immunoassay Kits 

15 (Cayman Chemical Company or R&D Systems) according to manufacturers' 
instructions. Substance P is expressed in pg/ml relative to a standard substance 
P curve run in parallel. 

SDS-PAGE and Western blot analysis were performed using standard protocols 
20 (Novex). SNAP-25 proteins were resolved on a 12% Tris/glycine polyacrylamide 
gel (Novex) and subsequently transferred to nitrocellulose membrane. The 
membranes were probed with a monoclonal antibody (SMI-81) that recognises 
cleaved and intact SNAP-25. Specific binding was visualised using peroxidase- 
conjugated secondary antibodies and a chemiluminescent detection system. 
25 Cleavage of SNAP-25 was quantified by scanning densitometry (Molecular 
Dynamics Personal SI, ImageQuant data analysis software). Percent SNAP-25 
cleavage was calculated according to the formula: (Cleaved SNAP- 
25/(Cleaved+lntact SNAP-25))x100. 

30 Following exposure of eDRG neurons to an LC/A-nociceptin-HN/A fusion (termed 
CPN-A), both inhibition of substance P release and cleavage of SNAP-25 are 
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observed (Figure 10). After 24 h exposure to the fusion, 50% of maximal SNAP- 
25 cleavage is achieved by a fusion concentration of 6.3±2.5 nM. 

-The effect of- the fusion- is-also assessed-at defined time points following a 16 h 
5 exposure of eDRG to CPN-A. Figure 1 1 illustrates the prolonged duration of 
action of the CPN-A fusion protein, with measurable activity still being observed 
at 28 days post exposure. 

Example 17 - Assessment of in vitro efficacy of an LC/A-nociceptin variant- 
10 Hn/A fusion 

Fusion protein prepared according to Examples 8 and 9 was assessed in the 
eDRG neuronal cell mode using the method described in Example 16. 

15 Following exposure of eDRG neurons to an LC/A-nociceptin variant-H N /A fusion 
(termed CPNv-A), both inhibition of substance P release and cleavage of SNAP- 
25 are observed. After 24 h exposure to the fusion, 50% of maximal SNAP-25 
cleavage is achieved by a fusion concentration of 1.4±0.4 nM (Figure 12). 

20 The effect of the fusion is also assessed at defined time points following a 16 h 
exposure of eDRG to CPN-A. Figure 13 illustrates the prolonged duration of 
action of the CPN-A fusion protein, with measurable activity still being observed 
at 24 days post exposure. 

25 The binding capability of the CPNv-A fusion protein is also assessed in 
comparison to the CPN-A fusion. Figure 14 illustrates the results of a competition 
experiment to determine binding efficacy at the ORL-1 receptor. CPNv-A is 
demonstrated to displace [3H]-nociceptin, thereby confirming that access to the 
receptor is possible with the ligand in the central presentation format. 

30 
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Example 18 - Preparation of an LC/A-nociceptin variant-H N /A fusion protein 
that is activated by treatment with Enterokinase 

Following-the methods used in Examples 1 and 2, the LC/A (SEQ ID1) and H N /A 
5 (SEQ ID2) are created and inserted into the A serotype nociceptin variant linker 
arranged as SamHI-Sa/l-linker-enterokinase protease site-nociceptin variant- 
A/nel-spacer-Spel-Psfl-Xbal-stop codon-H/ndlll (SEQ ID62). The final construct 
contains the LC-linker-nociceptin variant-spacer-H N ORF sequences (SEQ ID63) 
for expression as a protein of the sequence illustrated in SEQ ID64. The fusion 
10 protein is termed CPNv(Ek)-A. Figure 15 illustrates the purification of CPNv(Ek)- 
A from E. coli following the methods used in Example 9 but using Enterokinase 
for activation at 0.00064 ug per 100 ug of fusion protein. 

Example 19 - Assessment of in vitro efficacy of a LC/A-nociceptin variant- 
1 5 H N /A fusion that has been activated by treatment with enterokinase 

The CPNv(Ek)-A prepared in Example 18 is obtained in a purified form and 
applied to the eDRG cell model to assess cleavage of SNAP-25 (using 
methodology from Example 16). Figure 16 illustrates the cleavage of SNAP-25 
20 following 24 h exposure of eDRG to CPNv(Ek)-A. The efficiency of cleavage is 
observed to be similar to that achieved with the Factor Xa-cleaved material, as 
recorded in Example 17. 

Example 20 - Preparation of an LC/C-nociceptin variant-H N /C fusion protein 
25 with a Factor Xa activation linker derived from serotype A 

Following the methods used in Example 4, the LC/C (SEQ ID5) and H N /C (SEQ 
ID6) are created and inserted into the A serotype nociceptin variant linker 
arranged as BamHI-Sa/l-linker-nociceptin variant-A/bel-spacer-Spel-Psfl-Xbal- 
30 stop codon-tf/ndlll (SEQ ID65). The final construct contains the LC-linker- 
nociceptin variant-spacer-H N ORF sequences (SEQ ID66) for expression as a 
protein of the sequence illustrated in SEQ ID67. The fusion protein is termed 
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CPNv-C (act. A). Figure 17 illustrates the purification of CPNv-C (act. A) from £ 
coli following the methods used in Example 9. 

Example 21 - Assessment of in v/fro-efficacy of an LG/C-nociceptin variant- 
5 H N /C fusion protein 

Following the methods used in Example 9, the CPNv-C (act. A) prepared in 
Example 20 is obtained in a purified form and applied to the eDRG cell model to 
assess cleavage of SNAP-25 (using methodology from Example 16). After 24 h 
10 exposure to the fusion, 50% of maximal syntaxin cleavage is achieved by a 
fusion concentration of 3.1±2.0 nM. Figure 18 illustrates the cleavage of syntaxin 
following 24 h exposure of eDRG to CPNv-C (act. A). 

Example 22 - Assessment of in vivo efficacy of an LC/A-nociceptin-HN/A 
15 -fusion 

The ability of an LC/A-nociceptin- H N /A fusion (CPN/A) to inhibit acute capsaicin- 
induced mechanical allodynia is evaluated following subcutaneous intraplantar 
injection in the rat hind paw. Test animals are evaluated for paw withdrawal 

20 frequency (PWF%) in response to a 10 g Von Frey filament stimulus series (10 
stimuli x 3 trials) prior to recruitment into the study, after subcutaneous treatment 
with CPN/A but before capsaicin, and following capsaicin challenge post-injection 
of CPN/A (average of responses at 15' and 30'). Capsaicin challenge is achieved 
by injection of 10 uL of a 0.3% solution. Sample dilutions are prepared in 0.5% 

25 BSA/saline. Figure 19 illustrates the reversal of mechanical allodynia that is 
achieved by pre-treatment of the animals with a range of concentrations of LC/A- 
nociceptin-HN/A fusion. 

The ability of an LC/A-nociceptin-HN/A fusion (CPN/A) to inhibit streptozotocin 
30 (STZ)- induced mechanical (tactile) allodynia in rats is evaluated. STZ-induced 
mechanical allodynia in rats is achieved by injection of streptozotocin (i.p. or i.v.) 
which yields destruction of 'p'ancreatid^-cens'TeaaingTo roSsWhsulin production, 
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with concomitant metabolic stress (hyperglycemia and hyperlipidemia). As such, 
STZ induces Type I diabetes. In addition, STZ treatment leads to progressive 
development of neuropathy, which serves as a model of chronic pain with 
hyperalgesia and allodynia that may reflect signs observed in diabetic humans 
5 (peripheral diabetic neuropathy). 

Male Sprague-Dawley rats (250-300 g) are treated with 65 mg/kg STZ in citrate 
buffer (I.V.) and blood glucose and lipid are measured weekly to define the 
readiness of the model. Paw Withdrawal Threshold (PWT) is measured in 

10 response to a Von Frey filament stimulus series over a period of time. Allodynia 
is said to be established when the PWT on two consecutive test dates (separated 
by 1 week) measures below 6 g on the scale. At this point, rats are randomized 
to either a saline group (negative efficacy control), gabapentin group (positive 
efficacy control) or a test group (CPN/A). Test materials (20-25 ul) are injected 

15 subcutaneously as a- single- injection (except -gabapentin) and the PWT is 
measured at 1 day post-treatment and periodically thereafter over a 2-week 
period. Gabapentin (30 mg/kg i.p. @ 3 ml/kg injection volume) is injected daily, 
2 hours prior to the start of PWT testing. Figure 20 illustrates the reversal of 
allodynia achieved by pre-treatment of the animals with 750 ng of CPN/A. Data 

20 were obtained over a 2-week period after a single injection of CPN/A 

Example 23 - Assessment of in vivo efficacy of an LC/A-nociceptin variant- 
s/A fusion 

25 The ability of an LC/A-nociceptin variant-H N /A fusion (CPNv/A) to inhibit 
capsaicin-induced mechanical allodynia is evaluated following subcutaneous 
intraplantar injection in the rat hind paw. Test animals are evaluated for paw 
withdrawal frequency (PWF%) in response to a 10 g Von Frey filament stimulus 
series (10 stimuli x 3 trials) prior to recruitment into the study (Pre-Treat); after 

30 subcutaneous intraplantar treatment with CPNv/A but before capsaicin (Pre- 
CAP); and following capsaicin challenge post-injection of CPNv/A (average of 
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responses at 15* and 30'; CAP). Capsaicin challenge is achieved by injection of 
10 uL of a 0.3% solution. Sample dilutions are prepared in 0.5% BSA/saline. 

-Figure-2Hllustfates-the-reversal-of-allQdynia-that is -achieved -by-pre-treatment of 
5 the animals with a range of concentrations of LC/A-nociceptin variant-H N /A fusion 
in comparison to the reversal achieved with the addition of LC/A-nociceptin-H N /A 
fusion. These data are expressed as a normalized paw withdrawal frequency 
differential, in which the difference between the peak response (post-capsaicin) 
and the baseline response (pre-capsaicin) is expressed as a percentage. With 
10 this analysis, it can be seen that CPNv/A is more potent than CPN/A since a 
lower dose of CPNv/A is required to achieve similar analgesic effect to that seen 
with CPN/A. 

Example 24 - Preparation of an LC/A-leu enkephalin-H N /A fusion protein 

15 

Due to the small, five-amino acid, size of the leu-enkephalin ligand the LC/A-leu 
enkephalin-H N /A fusion is created by site directed mutagenesis [for example 
using Quickchange (Stratagene Inc.)] using the LC/A-nociceptin-H N /A fusion 
(SEQ ID13) as a template. Oligonucleotides are designed encoding the YGGFL 

20 leu-enkephalin peptide, ensuring standard E. coli codon usage is maintained and 
no additional restriction sites are incorporated, flanked by sequences 
complimentary to the linker region of the LC/A-nociceptin-H N /A fusion (SEQ ID13) 
either side on the nociceptin section. The SDM product is checked by 
sequencing and the final construct containing the LC-linker-leu enkephalin- 

25 spacer-H N ORF (SEQ ID68) for expression as a protein of the sequence 
illustrated in SEQ ID69. The fusion protein is termed CPLE-A. Figure 22 
illustrates the purification of CPLE-A from E. coli following the methods used in 
Example 9. 

30 



WO 2006/059093 



PCT/GB2005/004585 



68 

Example 25 - Expression and purification of an LC/A-beta-endorphin-H N /A 
fusion protein 

-Following the methods- used in Example 9, and with the LC/A-beta-endorphin- 
5 H N /A fusion protein (termed CPBE-A) created in Example 7, the CPBE-A is 
purified from E. coli. Figure 23 illustrates the purified protein as analysed by 
SDS-PAGE. 

Example 26 - Preparation of an LC/A-nociceptin mutant-H N /A fusion protein 

10 

Due to the single amino acid modification necessary to mutate the nociceptin 
sequence at position 1 from a Phe to a Tyr, the LC/A-nociceptin mutant-H N /A 
fusion is created by site directed mutagenesis [for example using Quickchange 
(Stratagene Inc.)] using the LC/A-nociceptin-H N /A fusion (SEQ ID13) as a 

15 template. Oligonucleotides are designed encoding tyrosine at position 1 of the 
nociceptin sequence, ensuring standard E. coli codon usage is maintained and 
no additional restriction sites are incorporated, flanked by sequences 
complimentary to the linker region of the LC/A-nociceptin-H N /A fusion (SEQ ID13) 
either side on the nociceptin section. The SDM product is checked by 

20 sequencing and the final construct containing the LC/A-nociceptin mutant-spacer- 
Hn/A fusion ORF (SEQ ID70) for expression as a protein of the sequence 
illustrated in SEQ ID71. The fusion protein is termed CPOP-A. Figure 24 
illustrates the purification of CPOP-A from E. coli following the methods used in 
Example 9. 

25 

Example 27 - Preparation and assessment of an LC/A-nociceptin variant 
mutant-H N /A fusion protein 

Due to the single amino acid modification necessary to mutate the nociceptin 
30 sequence at position 1 from a Phe to a Tyr, the LC/A-nociceptin variant mutant- 
HN/A fusion is created by site directed mutagenesis [for example using 
" "' Quickchange (Stratagene Inc.)] using the LC/A-nociceptin variant-H N /A fusion 
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(SEQ ID25) as a template. Oligonucleotides are designed encoding tyrosine at 
position 1 of the nociceptin sequence, ensuring standard E. coli codon usage is 
maintained and no additional restriction sites are incorporated, flanked by 
sequences-complimentary to the linker-region of the LC/A-nociceptin variant-HN/A 

5 fusion (SEQ ID25) either side on the nociceptin section. The SDM product is 
checked by sequencing and the final construct containing the LC/A-nociceptin 
mutant-spacer-H N /A fusion ORF (SEQ ID72) for expression as a protein of the 
sequence illustrated in SEQ ID73. The fusion protein is termed CPOPv-A. 
Figure 25 illustrates the purification of CPOPv-A from E. coli following the 

10 methods used in Example 9. 

Using methodology described in Example 16, CPOPv-A is assessed for its ability 
to cleave SNAP-25 in the eDRG cell model. Figure 26 illustrates that CPOPv-A 
is able to cleave SNAP-25 in the eDRG model, achieving cleavage of 50% of the 
15 maximal SNAP-25 after exposure of the cells to approximately 5.9 nM fusion for 
24 h. 

Example 28 - Preparation of an IgA protease-nociceptin variant-HN/A fusion 
protein 

20 

The IgA protease amino acid sequence was obtained from freely available 
database sources such as GenBank (accession number P09790). Information 
regarding the structure of the N. Gonorrhoeae IgA protease gene is available in 
the literature (Pohlner et a/., Gene structure and extracellular secretion of 

25 Neisseria gonorrhoeae IgA protease, Nature, 1987, 325(6103), 458-62). Using 
Backtranslation tool v2.0 (Entelechon), the DNA sequence encoding the IgA 
protease modified for E. coli expression was determined. A SamHI recognition 
sequence was incorporated at the 5' end and a codon encoding a cysteine amino 
acid and Sail recognition sequence were incorporated at the 3' end of the IgA 

30 DNA. The DNA sequence was screened using MapDraw, (DNASTAR Inc.) for 
restriction enzyme cleavage sequences incorporated during the back translation. 
- ■ > Any -cleavage sequences that are found to be common to those required for - - 
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cloning were removed manually from the proposed coding sequence ensuring 
common E. coli codon usage is maintained. E. coli codon usage was assessed 
Graphical Codon Usage Analyser (Geneart), and the %GC content and codon 
usage ratio assessed -by reference to published codon usage- tables. This 
5 optimised DNA sequence (SEQ ID74) containing the IgA open reading frame 
(ORF) is then commercially synthesized. 

The IgA (SEQ ID74) is inserted into the LC-linker-nociceptin variant-spacer-H N 
ORF (SEQ ID25) using BamHI and Sa/I restriction enzymes to replace the LC 
10 with the IgA protease DNA. The final construct contains the IgA-linker-nociceptin 
variant-spacer-H N ORF (SEQ ID75) for expression as a protein of the sequence 
illustrated in SEQ ID76. 

Example 29 - Preparation and assessment of a nociceptin targeted 
15 -endopeptidase fusion protein with a removable histidine purification tag. 

DNA was prepared that encoded a Factor Xa removable his-tag (his6), although 
it is clear that alternative proteases site such as Enterokinase and alternative 
purification tags such as longer histidine tags are also possible. Using one of a 

20 variety of reverse translation software tools [for example EditSeq best E. coli 
reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)], 
the DNA sequence encoding the Factor Xa removable his-tag region is 
determined. Restriction sites are then incorporated into the DNA sequence and 
can be arranged as Nhe\-\\nker-Spe\-Pst\-Hi/A-Xba\-LEIEGRSGHHHHHHStop 

25 cocfon-H/ndlll (SEQ ID77). The DNA sequence is screened for restriction 
sequence incorporated and any additional sequences are removed manually 
from the remaining sequence ensuring common E. coli codon usage is 
maintained. E. coli codon usage is assessed by reference to software programs 
such as Graphical Codon Usage Analyser (Geneart), and the %GC content and 

30 codon usage ratio assessed by reference to published codon usage tables (for 
example GenBank Release 143, 13 September 2004). This optimised DNA 
"sequencers then 'commercially synthesized (fbr'exampfe b'fEnle'lgchon, Geneart 
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or Sigma-Genosys) and is provided in the pCR 4 vector. In order to create 
CPNv-A-FXa-HT (SEQ ID78, removable his-tag construct) the pCR 4 vector 
encoding the removable his-tag is cleaved with Nhe\ and Hind\\\. The Nhe\ - 
■Hindtol -fragment-is then-inserted-into-the-tC/A-CPNv-H N /A -vector (SEQID25) that 
5 has also been cleaved by Nhe\ and Hind\\\. The final construct contains the 
LC/A-linker-nociceptin variant-spacer-H N -FXa-Histag-H/nc/lll ORF sequences 
(SEQ ID78) for expression as a protein of the sequence illustrated in SEQ ID79. 
Figure 27 illustrates the purification of CPNv-A-FXa-HT from E. coli following the 
methods used in Example 9. 

10 

Example 30 - Preparation of a leu-enkephalin targeted endopeptidase fusion 
protein containing a translocation domain derived from diphtheria toxin 

The DNA sequence is designed by back translation of the amino acid sequence 

15 of the translocation domain of the diphtheria toxin (obtained from freely available 
database sources such as GenBank (accession number 1XDTT) using one of a 
variety of reverse translation software tools [for example EditSeq best E. coli 
reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)]. 
Restriction sites are then incorporated into the DNA sequence and can be 

20 arranged as A/ftel-Linker-Spel-Psfl- diphtheria translocation domain-Xoal-stop 
codon-H/ndlll (SEQ ID80). Pst\/Xba\ recognition sequences are incorporated at 
the 5' and 3' ends of the translocation domain respectively of the sequence 
maintaining the correct reading frame. The DNA sequence is screened (using 
software such as MapDraw, DNASTAR Inc.) for restriction enzyme cleavage 

25 sequences incorporated during the back translation. Any cleavage sequences 
that are found to be common to those required by the cloning system are 
removed manually from the proposed coding sequence ensuring common E. coli 
codon usage is maintained. E. coli codon usage is assessed by reference to 
software programs such as Graphical Codon Usage Analyser (Geneart), and the 

30 %GC content and codon usage ratio assessed by reference to published codon 
usage tables (for example GenBank Release 143, 13 September 2004). This 
optimised" DNA sequence cbri'tain'ihg 1 tFte diphtheria'- translocation domain is then 
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commercially synthesized as /Mel-Linker-Spel-Psfl- diphtheria translocation 
domain-Xbal-stop codon-H/ndlll (for example by Entelechon, Geneart or Sigma- 
Genosys) and is provided in the pCR 4 vector (Invitrogen). The pCR 4 vector 
encoding- the-diphtheria- translocation domain is cleaved with Nhe\ and Xbal. The 
5 Nhe\ - Xbal fragment is then inserted into the LC/A-CPLE-H N /A vector (SEQ 
ID68) that has also been cleaved by Nhe\ and Xbal. The final construct contains 
the LC/A-leu-enkephalin-spacer-diphtheria translocation domain ORF sequences 
(SEQ ID81)for expression as a protein of the sequence illustrated in SEQ ID82. 

10 Example 31 - Preparation of a nociceptin variant targeted endopeptidase 
fusion protein containing a LC domain derived from tetanus toxin. 

The DNA sequence is designed by back translation of the tetanus toxin LC amino 
acid sequence (obtained from freely available database sources such as 

15 -GenBank (accession-number -X04436) using - one of - a variety of reverse 
translation software tools [for example EditSeq best E. coli reverse translation 
(DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)]. BamH\ISal\ 
recognition sequences are incorporated at the 5' and 3' ends respectively of the 
sequence maintaining the correct reading frame (SEQ ID83). The DNA 

20 sequence is screened (using software such as MapDraw, DNASTAR Inc.) for 
restriction enzyme cleavage sequences incorporated during the back translation. 
Any cleavage sequences that are found to be common to those required by the 
cloning system are removed manually from the proposed coding sequence 
ensuring common E. coli codon usage is maintained. E. coli codon usage is 

25 assessed by reference to software programs such as Graphical Codon Usage 
Analyser (Geneart), and the %GC content and codon usage ratio assessed by 
reference to published codon usage tables (for example GenBank Release 143, 
13 September 2004). This optimised DNA sequence containing the tetanus toxin 
LC open reading frame (ORF) is then commercially synthesized (for example by 

30 Entelechon, Geneart or Sigma-Genosys) and is provided in the pCR 4 vector 
(invitrogen). The pCR 4 vector encoding the TeNT LC is cleaved with BamHI and 
: Salt The BamHI' * Sail fragment is then-inserted into the LC/A-CPNv-H N /A 
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vector (SEQ ID25) that has also been cleaved by BamHI and Sa/I. The final 
construct contains the TeNT LC-linker-nociceptin variant-spacer-H N ORF 
sequences (SEQ ID84) for expression as a protein of the sequence illustrated in 
-SEQID85r- 

5 

Example 32 - Preparation of an LC/C-nociceptin variant-H N /C fusion protein 
with a native serotype C linker that is susceptible to Factor Xa cleavage 

Following the methods used in Example 4, the LC/C (SEQ ID5) and H N /C (SEQ 
10 ID6) are created and inserted into the C serotype nociceptin variant linker 
arranged as BamHI-Sa/l-linker-nociceptin variant-A/hel-spacer-Spel-Psfl-Xoal- 
stop codon-H/ndlll (SEQ ID86). The final construct contains the LC-linker- 
nociceptin variant-spacer-H N ORF sequences (SEQ ID87) for expression as a 
protein of the sequence illustrated in SEQ ID88. The fusion protein is termed 
15 CPNv-C (act. C). 
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Claims: 



1 . A single chain, polypeptide fusion protein, comprising: 

a. a non-cytotoxic protease, or a fragment thereof, which 
protease or protease fragment is capable of cleaving a 
protein of the exocytic fusion apparatus of a nociceptive 
sensory afferent; 

b. a Targeting Moiety that is capable of binding to a Binding 
Site on the nociceptive sensory afferent, which Binding Site 
is capable of undergoing endocytosis to be incorporated into 
an endosome within the nociceptive sensory afferent; 

c. a protease cleavage site at which site the fusion protein is 
cleavable by a protease, wherein the protease cleavage site 
is located between the non-cytotoxic protease or fragment 
thereof and the Targeting Moiety; and 

d. a translocation domain that is capable of translocating the 
protease or protease fragment from within an endosome, 
across the endosomal membrane and into the cytosol of the 
nociceptive sensory afferent. 

2. The fusion protein according to Claim 1 , wherein the Targeting Moiety and 
the protease cleavage site are separated by at most 10 amino acid 
residues, preferably by at most 5 amino acid residues, and more 
preferably by at most zero amino acid residues. 

3. The fusion protein according to Claim 1 or Claim 2, wherein the Targeting 
Moiety is located between the protease cleavage site and the translocation 
domain. 



4. The fusion protein according to any preceding claim, wherein the non- 
> -i cytotoxic .protease is a clostridial neurotoxin L-chain or an IgA protease. 
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5. The fusion protein according to any preceding claim, wherein the 
translocation domain is the H N domain of a clostridial neurotoxin. 

6: The fusion protein according to any preceding claim, wherein the 
5 Targeting Moiety comprises at most 50 amino acid residues, preferably at 

most 40 amino acid residues, more preferably at least 30 amino acid 
residues, and most preferably at most 20 amino acid residues. 

7. The fusion protein according to any of Claims 1-6, wherein the Targeting 
10 Moiety is an opioid. 

8. The fusion protein according to any of Claim 1-6, wherein the Targeting 
Moiety is an agonist of a receptor present on a nociceptive sensory 
afferent. 

15 

9. The fusion protein according to Claim 8, wherein the Targeting Moiety is 
an agonist of a receptor present on a primary nociceptive sensory afferent. 

10. The fusion protein according to any of Claims 1-6, wherein the Targeting 
20 Moiety binds to the ORI_i receptor. 

11. The fusion protein according to Claim 10, wherein the Targeting Moiety 
binds specifically to the ORI_i receptor. 

25 12. The fusion protein according to Claim 10 or 11, wherein the Targeting 
Moiety is an agonist of the ORLi receptor. 

13. The fusion protein according to any one of Claims 10-12, wherein the 
Targeting Moiety has at least 70% homology to SEQ ID No. 38 or a 
30 fragment thereof. 
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14. The fusion protein according to Claim 13, wherein the Targeting Moiety as 
at least 80% homology to SEQ ID No. 38 or a fragment thereof. 

15. The fusion protein according to Claim 14, wherein the Targeting Moiety 
has at least 90% homology to SEQ ID No. 38 or a fragment thereof. 

16. The fusion protein according to Claim 15, wherein the Targeting Moiety 
has at least 95% homology to SEQ ID No. 38 or a fragment thereof. 

17. The fusion protein according to any one of Claims 10-12, wherein the 
Targeting Moiety is SEQ ID No. 38 or a fragment thereof. 

18. The fusion protein according to any of Claims 10-12, wherein the 
Targeting Moiety is one of SEQ ID Nos: 40, 42, 44, 46, 48 or 50. 

19. The fusion protein according to any one of Claims 10-12, wherein the 
Targeting Moiety is nociceptin. 

20. The fusion protein according to any of Claims 1-6, wherein the Targeting 
Moiety is selected from the group consisting of nociceptin, B-endorphin, 
endomorphine-1 , endomorphine-2, dynorphin, met-enkephalin, leu- 
enkephalin, galanin, and PAR-2 peptide. 

21. The fusion protein according to any preceding claim, wherein the fusion 
protein comprises a purification tag. 

22. The fusion protein according to Claim 21, wherein the fusion protein 
comprises a purification tag, which is present at the N-terminal and/ or C- 
terminal end of the fusion protein. 
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23. The fusion protein according to Claim 21 or 22, wherein the purification tag 
is joined to the fusion protein by a peptide spacer molecule. 

24. The fusion -protein according to any preceding claim, wherein the 
5 translocation domain is separated from the Targeting Moiety by a peptide 

spacer molecule. 

25. A polypeptide fusion protein comprising any one of SEQ ID NOs: 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 52, 59, 61, 64, 67, 69, 71, 73, 76, 79, 

10 82, 85, or 88. 

26. A nucleic acid sequence encoding the polypeptide fusion protein according 
to any preceding Claim. 

15 27. A nucleic acid sequence according to Claim 26, wherein the nucleic acid 
molecule comprises any one of SEQ ID NOs: 1-13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 58, 60, 63, 66, 68, 70, 72, 
75, 78, 81,84, or 87. 

20 28. A DNA vector, which comprises a promoter, a nucleic acid sequence 
according to Claim 26 or Claim 27, wherein said DNA sequence is located 
downstream of the promoter, and a terminator is located downstream of 
the DNA construct. 

25 29. The complementary DNA strand of the DNA sequence according to 
Claim 26 or Claim 27. 

30. A method for preparing a single-chain polypeptide fusion protein according 
to any of Claims 1-24, comprising expressing a nucleic acid sequence 
30 according to Claim 26 or Claim 27, or a DNA vector according to Claim 28, 

in a host cell. 
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31 . A method of preparing a non-cytotoxic agent, comprising: 

a. contacting a single-chain polypeptide fusion protein 
according to any of Claims 1-24 with a protease capable of 
cleaving the -protease cleavage site; 

5 b. cleaving the protease cleavage site; and thereby forming a 

di-chain fusion protein. 

32. A non-cytotoxic polypeptide, obtainable by the method of Claim 31, 
wherein the polypeptide is a di-chain polypeptide, and wherein: 

10 a. the first chain comprises the non-cytotoxic protease, or a 

fragment thereof, which protease or protease fragment is 
capable of cleaving a protein of the exocytic fusion apparatus 
of a nociceptive sensory afferent; 

b. the second chain comprises the TM and the translocation 
15 domain that is capable of translocating the protease or 

protease fragment from within an endosome, across the 
endosomal membrane and into the cytosol of the nociceptive 
sensory afferent; and 
the first and second chains are disulphide linked together. 

20 

33. Use of a fusion protein according to any of Claims 1-24 or a polypeptide 
according to Claim 32, for the manufacture of a medicament for treating, 
preventing or ameliorating pain. 

25 34. Use according to Claim 33, wherein the pain is chronic pain. 

35. A method of treating, preventing or ameliorating pain in a subject, 
comprising administering to said patient a therapeutically effective amount 
of a fusion protein according to any of Claims 1-24 or a polypeptide 

30 according to Claim 32. 

36. A method according to Claim 35, wherein the pain is chronic pain. 
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SEQUENCE LISTING 

<110> Health Protection Agency 
Allergan, Inc. 
Chaddock, John 
Foster, Keith 
Marks, Philip 
Stancombe, Patrick 
Aoki, K Roger 
Francis, Joseph 
Steward, Lance 

<120> Fusion Proteins 

<130> P26885WO/MRM 

<150> GB0426394.3 
<151> 2004-12-01 

<160> 88 

<170> Patentln version 3.3 

<210> 1 

<211> 1302 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 1 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag atatcgcatc 


caccctgaac 


900 
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aaagcgaaat 


ccatcgtggg taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga agacacctcc 


ggcaaattct 


ctgtagacaa gttgaaattc 


1020 


gataaacttt 


acaaaatgct gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt ttacaagctg 


ctgtgcgtcg 


ac 




1302 



<210> 2 

<211> 1257 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 2 



ctgcagtgta 


tcaaggttaa 


caactgggat 


ttattcttca 


gcccgagtga 


agacaacttc 


60 


accaacgacc 


tgaacaaagg 


tgaagaaatc 


acctcagata 


ctaacatcga 


agcagccgaa 


120 


gaaaacatct 


cgctggacct 


gatccagcag 


tactacctga 


cctttaattt 


cgacaacgag 


180 


ccggaaaaca 


tttctatcga 


aaacctgagc 


tctgatatca 


tcggccagct 


ggaactgatg 


240 


ccgaacatcg 


aacgtttccc 


aaacggtaaa 


aagtacgagc 


tggacaaata 


taccatgttc 


300 


cactacctgc 


gcgcgcagga 


atttgaacac 


ggcaaatccc 


gtatcgcact gactaactcc 


360 


gttaacgaag 


ctctgctcaa 


cccgtcccgt 


gtatacacct 


tcttctctag 


cgactacgtg 


420 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga 


acagcttgtt 


480 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc ggatatcact 


540 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta 


caaagacgac 


600 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat 


cccggaaatc 


660 


gccatcccgg 


tactgggcac 


ctttgctctg 


gtttcttaca 


ttgcaaacaa ggttctgact 


720 


gtacaaacca 


tcgacaacgc 


gctgagcaaa 


cgtaacgaaa 


aatgggatga 


agtttacaaa 


780 


tatatcgtga 


ccaactggct 


ggctaaggtt 


aatactcaga 


tcgacctcat 


ccgcaaaaaa 


840 


atgaaagaag 


cactggaaaa 


ccaggcggaa 


gctaccaagg 


caatcattaa 


ctaccagtac 


900 


aaccagtaca 


ccgaggaaga 


aaaaaacaac 


atcaacttca 


acatcgacga 


tctgtcctct 


960 


aaactgaacg 


aatccatcaa 


caaagctatg 


atcaacatca 


acaagttcct 


gaaccagtgc 


1020 


tctgtaagct 


atctgatgaa 


ctccatgatc 


ccgtacggtg 


ttaaacgtct ggaggacttc 


1080 
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gatgcgtctc tgaaagacgc cctgctgaaa tacatttacg acaaccgtgg cactctgatc 114 0 
ggtcaggttg atcgtctgaa ggacaaagtg aacaatacct tatcgaccga catccctttt 1200 
cagctcagta aatatgtcga taaccaacgc cttttgtcca ctctagacta gaagctt 1257 

<210> 3 

<211> 1323 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 3 



ggatccatgc 


cggttaccat 


caacaacttc 


aactacaacg 


acccgatcga 


caacaacaac 


60 


atcattatga 


tggaaccgcc 


gttcgcacgt 


ggtaccggac 


gttactacaa 


ggcttttaag 


120 


atcaccgacc 


gtatctggat 


catcccggaa 


cgttacacct 


tcggttacaa 


acctgaggac 


180 


ttcaacaaga 


gtagcgggat 


tttcaatcgt 


gacgtctgcg 


agtactatga 


tccagattat 


240 


ctgaatacca 


acgataagaa 


gaacatattc 


cttcagacta 


tgattaaact 


cttcaaccgt 


300 


atcaaaagca 


aaccgctcgg 


tgaaaaactc 


ctcgaaatga 


ttatcaacgg 


tatcccgtac 


360 


ctcggtgacc 


gtcgtgtccc 


gcttgaagag 


ttcaacacca 


acatcgcaag 


cgtcaccgtc 


420 


aacaaactca 


tcagcaaccc 


aggtgaagtc 


gaacgtaaaa 


aaggtatctt 


cgcaaacctc 


480 


atcatcttcg 


gtccgggtcc 


ggtcctcaac 


gaaaacgaaa 


ccatcgacat 


cggtatccag 


540 


aaccacttcg 


caagccgtga 


aggtttcggt 


ggtatcatgc 


agatgaaatt 


ctgcccggaa 


600 


tacgtcagtg 


tcttcaacaa 


cgtccaggaa 


aacaaaggtg 


caagcatctt 


caaccgtcgt 


660 


ggtLaCtuca 


ycgacccyyc 


aCUCaCCCCt 








720 


ctctacggta 


tcaaagttga 


cgacctcccg 


atcgtcccga 


acgagaagaa 


attcttcatg 


780 


cagagcaccg 


acgcaatcca 


ggctgaggaa 


ctctacacct 


tcggtggcca 


agacccaagt 


840 


atcataaccc 


cgtccaccga 


caaaagcatc 


tacgacaaag 


tcctccagaa 


cttcaggggt 


900 


atcgtggaca 


gactcaacaa 


agtcctcgtc 


tgcatcagcg 


acccgaacat 


caatatcaac 


960 


atatacaaga 


acaagttcaa 


agacaagtac 


aaattcgtcg 


aggacagcga 


aggcaaatac 


1020 


agcatcgacg 


tagaaagttt 


cgacaagctc 


tacaaaagcc 


tcatgttcgg 


tttcaccgaa 


1080 


accaacatcg 


ccgagaacta 


caagatcaag 


acaagggcaa 


gttacttcag 


cgacagcctc 


1140 


ccgcctgtca 


aaatcaagaa 


cctcttagac 


aacgagattt 


acacaattga 


agagggcttc 


1200 


aacatcagtg 


acaaagacat 


ggagaaggaa 


tacagaggtc 


agaacaaggc 


tatcaacaaa 


1260 


caggcatacg 


aggagatcag 


caaagaacac 


ctcgcagtct 


acaagatcca 


gatgtgcgtc 


1320 
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<210> 4 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic 










<4UU> 4 

ctgcagtgca 


tcgacgttga 


caacgaagac 


ctgttcttca 


tcgctgacaa aaacagcttc 


60 


agtgacgacc 


tgagcaaaaa 


cgaacgcacc 


gaatacaaca 


cccagagcaa ctacatcgaa 


120 


aacgacttcc 


cgatcaacga 


actgatcctg 


gacaccgacc 


tgataagtaa aatcgaactg 


180 


ccgagcgaaa 


acaccgaaag 


tctgaccgac 


ttcaacgttg 


acgttccggt ttacgaaaaa 


240 


cagccggcta 


tcaagaaaat 


cttcaccgac 


gaaaacacca 


tcttccagta cctgtacagc 


300 


cagaccttcc 


cgctggacat 


ccgtgacatc 


agtctgacca 


gcagtttcga cgacgctctg 


360 


ctgttcagca 


acaaagttta 


cagtttcttc 


agcatggact 


acatcaaaac cgctaacaaa 


420 


gttgttgaag 


cagggctgtt 


cgctggttgg 


gttaaacaga 


tcgttaacga cttcgttatc 


480 


gaagctaaca 


aaagcaacac 


tatggacaaa 


atcgctgaca 


tcagtctgat cgttccgtac 


540 


atcggtctgg 


ctctgaacgt 


tggtaacgaa 


accgctaaag 


gtaactttga aaacgctttc 


600 


gagatcgctg 


gtgcaagcat 


cctgctggag 


ttcatcccgg 


aactgctgat cccggttgtt 


660 


ggtgctttcc 


tgctggaaag 


ttacatcgac 


aacaaaaaca 


agatcatcaa aaccatcgac 


720 


aacgctctga 


ccaaacgtaa 


cgaaaaacgg 


agtgatatgt 


acggtctgat cgttgctcag 


780 


tggctgagca 


ccgtcaacac 


ccagucccac 


accatcaaag 


aaggtatgta caaagctctg 


840 


aactaccagg 


ctcaggctct 


ggaagaga u c 


atcaaatacc 


gttacaacat ctacagtgag 


900 


aaggaaaaga 


gtaacatcaa 


catcgac'ttc 


aacgacatca 


acagcaaact gaacgaaggt 


960 


atcaaccagg 


ctatcgacaa 


catcaacaac 


ttcatcaacg 


gttgcagtgt tagctacctg 


1020 


atgaagaaga 


tgatcccgct 


ggctgttgaa 




acttccracaa caccctcraaa 


1080 


aagaacctgc 


tgaactacat 


cgacgaaaac 


aagctgtacc 


tgatcggtag tgctgaatac 


1140 


gaaaaaagta 


aagtgaacaa 


atacctgaag 


accatcatgc 


cgttcgacct gagtatctac 


1200 


accaacgaca 


ccatcctgat 


cgaaatgttc 


aacaaataca 


actctctaga ctagaagctt 


1260 



<210> 5 

<211> 1329 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic 
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<400> 5 
ggatccgaat 


tcatgccgat 


caccatcaac aacttcaact acagcgatcc ggtggataac 


60 


aaaaacatcc 


tgtacctgga 


tacccatctg aataccctgg cgaacgaacc ggaaaaagcg 


120 


tttcgtatca 


ccggcaacat 


ttgggttatt ccggatcgtt ttagccgtaa cagcaacccg 


180 


aatctgaata 


aaccgccgcg 


tgttaccagc ccgaaaagcg gttattacga tccgaactat 


240 


ctgagcaccg 


atagcgataa 


agataccttc ctgaaagaaa tcatcaaact gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg atctatcgcc tgagcaccga tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc tttgatttcg atgtggattt caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg gtgaaaaccg gcagcattaa cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt gatccggaaa ccagcacctt taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt ggcgcgctga gcattattag cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc aacgatgttg gtgaaggccg tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc ctgatgcatg aactgaacca tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat cagaccatta gcagcgtgac cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa tatgcggaaa tctatgcgtt tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc aaatacttcg aagaaaaagc gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc attaccaccg cgaatccgag cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg atccgcaaat atcgctttgt ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa ttcgtggaac tgtacaacga actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc tataacgtgc agaaccgtaa aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat attctggatg ataacgtgta cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg aacgttctgt ttatgggcca gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg gaaaacatgc tgtacctgtt caccaaattt 


• 1320 


tgcgtcgac 






1329 



<210> 6 

<211> 1263 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 6 

ctgcagtgtc gtgaactgct ggtgaaaaac accgatctgc cgtttattgg cgatatcagc 60 



.> gatgtgaaaa ccgatatctt cctgcgcaaa gatatcaacg aagaaaccga agtgatctac 



120 
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tacccggata 

cagctggatc 

caggtgtttt 

gaaagccaga 

gcgctggata 

gcgggtgttc 

accaacatcc 

tatattggtc 

tttgcggtta 

ctgggtgcgt 

gataactgcc 

acctggctga 

ctgaactatc 

ggcagcgata 

aaaattagcg 

ctgttcaaaa 

aaagcgaaac 

aaactgaaag 

tataccaaca 

ctt 



acgtgagcgt 

tgctgtatcc 

acgataaccg 

aactgagcga 

acagcgcgaa 

agggcggtct 

tgcgtaaaga 

cggcgctgaa 

ccggtgtgac 

ttgtgatcta 

tggaacagcg 

gccgtattat 

aggcgggtgc 

aagaaaacat 

aagcgatgaa 

acatgctgcc 

tgatcaacct 

cgaaagttaa 

acagcctgct 



tgatcaggtg 

gagcattgat 

tacccagaac 

taacgtggaa 

agtttacacc 

gtttctgatg 

taccctggat 

cattagcaat 

cattctgctg 

tagcaaagtg 

tattaaacgc 

cacccagttc 

gattaaagcg 

caaaagccag 

taacatcaac 

gaaagtgatc 

gatcgatagc 

caacagcttc 

gaaagatatc 



atcctgagca 

agcgaaagcg 

gtggattacc 

gattttacct 

tattttccga 

tgggcgaacg 

aaaatcagcg 

agcgtgcgtc 

gaagcgtttc 

caggaacgca 

tggaaagata 

aacaacatca 

aaaatcgatc 

gttgaaaacc 

aaattcatcc 

gatgaactga 

cacaacatta 

cagaacacca 

atcaacgaat 



aaaacaccag 

aaattctgcc 

tgaacagcta 

ttacccgcag 

ccctggcgaa 

atgtggtgga 

atgttagcgc 

gtggcaattt 

cggaatttac 

acgaaatcat 

gctatgaatg 

gctaccagat 

tggaatacaa 

tgaaaaacag 

gcgaatgcag 

acgaatttga 

ttctggtggg 

tcccgtttaa 

acttcaatct 



cgaacatggt 
gggcgaaaac 
ttactacctg 
cattgaagaa 
caaagttaat 
agatttcacc 
gattattccg 
taccgaagcg 
cattccggcg 
caaaaccatc 
gatgatgggc 
gtacgatagc 
aaaatacagc 
cctggatgtg 
cgtgacctac 
tcgcaacacc 
cgaagtggat 
catcttcagc 
agactagaag 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1263 



<210> 7 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 7 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 
acgcacggtc tagaatgata aaagctt 



60 
120 
180 
207 



<2iO>- Tl 8 : '~ ~ 
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<211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 8 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
aacaaagcgc tgaacctgca gacgcacggt ctagaatgat aaaagctt 108 



<210> 9 

<211> 186 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 9 

catatgaata acctcgggat tgagggtcgt tttggcggtt tcacgggcgc acgcaaatca 



60 



gcgcgtaaat tagctaacca gactagtggc ggtgggggta gtggcggtgg cggttcgggc 120 
gggggtggga gccctagggg atccgtcgac ctgcagggtc tagaagcgct agcgtgataa 180 
aagctt 186 

<210> 10 
<211> 180 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 10 

ggatccacgc acgtcgacgc gattgatggt cgttttggcg gtttcacggg cgcacgcaaa 60 
tcagcgcgta aattagctaa ccaggcgcta gcgggcggtg gcggtagcgg cggtggcggt 120 
agcggcggtg gcggtagcgc actagtgctg cagacgcacg gtctagaatg ataaaagctt 180 

<210> 11 
<211> 249 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 11 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 60 
tacggtggtt tcatgacctc tgaaaaatct cagac.cccgc tggttaccct gttcaaaaac 120 
gctatcatca aaaacgctta caaaaaaggt -gaagcgctag -cgggtggtgg tggttctggt 180 
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ggtggtggtt ctggtggtgg tggttctgca ctagtgctgc agacgcacgg tctagaatga 240 



taaaagctt 



249 



<210> 12 
<211> 207 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Synthetic 
<400> 12 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat cgaaggtcgt 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 13 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 13 
ggatccatgg 


agttcgttaa 


caaacagttc aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg ggcgctggta aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt cacgacgcta aattcatcga ctctctgcaa 


840 
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gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat 


cgtctgaagg 


acaaagtgaa 


caatacctta 


2640 


tcgaccgaca - tccctttfcca 


gctcagtaaa: 


tatgtcgata 


accaacgcct 


tttgtccact 


2700 
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ctagactag 



2709 



^ni fix 


1 A 


<211> 


902 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


14 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 " 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 " * 185 190 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 
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Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 " '440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He lie 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
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675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 7.45 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 ~ 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 15 

<211->- 2736- -iv— h ■„ ^ ~ 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 15 



ctcgggattg agggtcgttt 


tggcggtttc 


acgggcgcac gcaaatcagc 


^ ^ o ■fr* +•* a 

yCyuaaatta 


o u 


gctaaccaga ctagtggcgg 


tgggggtagt 


ggcggtggcg gttcgggcgg 


gggtgggagc 


ion 
LA U 


cctaggggat ccatggagtt 


cgttaacaaa 


cagttcaact ataaagaccc 


agttaacggt 


180 


gttgacattg cttacatcaa 


aatcccgaac 


gctggccaga tgcagccggt 


aaaggcattc 


240 


aaaatccaca acaaaatctg 


ggttatcccg 


gaacgtgata cctttactaa 


cccggaagaa 


300 


ggtgacctga acccgccacc 


ggaagcgaaa 


caggtgccgg tatcttacta 


tgactccacc 


360 


tacctgtcta ccgataacga 


aaaggacaac 


tacctgaaag gtgttactaa 


actgttcgag 


420 


cgtatttact ccaccgacct 


gggccgtatg 


ctgctgacta gcatcgttcg 


cggtatcccg 


480 


ttctggggcg gttctaccat 


cgataccgaa 


ctgaaagtaa tcgacactaa 


ctgcatcaac 


540 


gttattcagc cggacggttc 


ctatcgttcc 


gaagaactga acctggtgat 


catcggcccg 


600 


tctgctgata tcatccagtt 


cgagtgtaag 


agctttggtc acgaagttct 


gaacctcacc 


f /t r\ 

660 


cgtaacggct acggttccac 


tcagtacatc 


cgtttctctc cggacttcac 


cttcggtttt 


720 


gaagaatccc tggaagtaga 


cacgaaccca 


ctgctgggcg ctggtaaatt 


cgcaactgat 


780 


cctgcggtta ccctggctca 


cgaactgatt 


catgcaggcc accgcctgta 


cggtatcgcc 


840 


atcaatccga accgtgtctt 


caaagttaac 


accaacgcgt attacgagat 


gtccggtctg 


900 


gaagttagct tcgaagaact 


gcgtactttt 


ggcggtcacg acgctaaatt 


catcgactct 


960 


ctgcaagaaa acgagttccg 


tctgtactac 


tataacaagt tcaaagatat 


cgcatccacc 


1020 


ctgaacaaag cgaaatccat 


cgtgggtacc 


actgcttctc tccagtacat 


gaagaacgtt 


i r\ o r\ 
1080 


tttaaagaaa aatacctgct 


cagcgaagac 


acctccggca aattctctgt 


agacaagttg 


1 1 A (\ 


aaattcgata aactttacaa 


aatgctgact 


gaaatttaca ccgaagacaa 


■ cttcgttaag 


izUU 


ttctttaaag ttctgaaccg 


caaaacctat 


ctgaacttcg acaaggcagt 


attcaaaatc 


X^ou 


aacatcgtgc cgaaagttaa 


ctacactatc 


tacgatggtt tcaacctgcg 


taacaccaac 


1320 


ctggctgcta attttaacgg 


ccagaacacg 


gaaatcaaca acatgaactt 


cacaaaactg 


1380 


aaaaacttca ctggtctgtt 


cgagttttac 


aagctgctgt gcgtcgacgg 


catcattacc 


1440 


tccaaaacta aatctctgat 


agaaggtaga 


aacaaagcgc tgaacgacct 


ctgtatcaag 


1500 


gttaacaact gggatttatt 


cttcagcccg 


agtgaagaca acttcaccaa 


cgacctgaac 


1560 


aaaggtgaag aaatcacctc 


agatactaac 


atcgaagcag ccgaagaaaa 


catctcgctg 


1620 
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gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


acgagccgga 


aaacatttct 


1680 


atcgaaaacc 


tgagctctga 


tatcatcggc 


cagctggaac 


tgatgccgaa 


catcgaacgt 


1740 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


tgttccacta 


cctgcgcgcg 


1800 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


1860 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tctagcgact 


acgtgaaaaa 


ggtcaacaaa 


1920 


gcgactgaag 


ctgcaatgtt 


cttgggttgg 


gttgaacagc 


ttgtttatga 


ttttaccgac 


1980 


gagacgtccg 


aagtatctac 


taccgacaaa 


attgcggata 


tcactatcat 


catcccgtac 


2040 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


acgacttcgt 


tggcgcactg 


2100 


atcttctccg 


gtgcggtgat 


cctgctggag 


ttcatcccgg 


aaatcgccat 


cccggtactg 


2160 


ggcacctttg 


ctctggtttc 


ttacattgca 


aacaaggttc 


tgactgtaca 


aaccatcgac 


2220 


aacgcgctga 


gcaaacgtaa 


cgaaaaatgg 


gatgaagttt 


acaaatatat 


cgtgaccaac 


2280 


tggctggcta 


aggttaatac 


tcagatcgac 


ctcatccgca 


aaaaaatgaa 


agaagcactg 


2340 


gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc 


agtacaacca 


gtacaccgag 


2400 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt 


cctctaaact 


gaacgaatcc 


2460 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc 


agtgctctgt 


aagctatctg 


2520 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg 


acttcgatgc 


gtctctgaaa 


2580 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc 


tgatcggtca 


ggttgatcgt 


2640 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc 


cttttcagct 


cagtaaatat 


2700 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 






2736 



<210> 16 
<211> 911 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 16 

Leu Gly He Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser 
15 10 15 

Ala Arg Lys Leu Ala Asn Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser Met Glu Phe Val 
35 40 45 
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Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly Val Asp lie Ala 
50 55 60 



Tyr He Lys He Pro Asn Ala Gly Gin Met Gin Pro Val Lys Ala Phe 
65 " 70 75 80 



Lys He His Asn Lys He Trp Val He Pro Glu Arg Asp Thr Phe Thr 

85 . .„ .90. 95 



Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala Lys Gin Val 
100 105 HO 



Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp Asn Glu Lys 
115 120 125 



Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg lie Tyr Ser 
13 0 135 140 



Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val Arg Gly He Pro 
145 " 150 155 160 



Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys Val He Asp Thr 
165 170 175 



Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr Arg Ser Glu Glu 
180 185 190 



Leu Asn Leu Val He He Gly Pro Ser Ala Asp lie lie Gin Phe Glu 
195 200 205 



Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg Asn Gly Tyr 
210 215 220 



Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe Thr Phe Gly Phe 
225 230 235 240 



Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly Ala Gly Lys 
245 250 255 



Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu He His Ala 
260 265 270 



Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn Arg Val Phe Lys 
275 280 285 



Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu Val Ser Phe 
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290 295 300 



Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe He Asp Ser 
305 310 315 320 



Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys Phe Lys Asp 
325 330 335 



He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val Gly Thr Thr Ala 
340 345 350 



Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr Leu Leu Ser 
355 360 365 



Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys Phe Asp Lys 
370 375 380 



Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp Asn Phe Val Lys 
385 390 395 400 



Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe Asp Lys Ala 
405 410 415 



Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr Thr He Tyr Asp 
420 425 430 



Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe Asn Gly Gin 
435 440 445 



Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu Lys Asn Phe Thr 
450 455 460 



Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly He He Thr 
465 470 475 480 



Ser Lys Thr Lys Ser Leu lie Glu Gly Arg Asn Lys Ala Leu Asn Asp 
485 490 495 



Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 

''--^530 : - • " -535 - ,% • u.. - 540 ^ 
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Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn lie Ser 
545- 550 555 560 



lie Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 630 635 640 



Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 
660 665 670 



Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 
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Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met lie Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 
865 870 875 880 



Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin 
885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 



<210> 17 
<211> 2715 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 17 

ggatccgaat tcatgccgat caccatcaac aacttcaact acagcgatcc ggtggataac 60 
aaaaacatcc tgtacctgga tacccatctg aataccctgg cgaacgaacc ggaaaaagcg 120 
tttcgtatca ccggcaacat ttgggttatt ccggatcgtt ttagccgtaa cagcaacccg 180 
aatctgaata aaccgccgcg tgttaccagc ccgaaaagcg gttattacga tccgaactat 240 
ctgagcaccg atagcgataa agataccttc ctgaaagaaa tcatcaaact gttcaaacgc 300 
atcaacagcc gtgaaattgg cgaagaactg atctatcgcc tgagcaccga tattccgttt 3 60 
ccgggcaaca acaacacccc gatcaacacc tttgatttcg atgtggattt caacagcgtt 420 
gatgttaaaa cccgccaggg taacaattgg gtgaaaaccg gcagcattaa cccgagcgtg 480 
attattaccg gtccgcgcga aaacattatt gatccggaaa ccagcacctt taaactgacc 540 
aacaacacct ttgcggcgca ggaaggtttt ggcgcgctga gcattattag cattagcccg 600 
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cgctttatgc tgacctatag caacgcgacc aacgatgttg gtgaaggccg tttcagcaaa 660 

agcgaatttt gcatggaccc gatcctgatc ctgatgcatg aactgaacca tgcgatgcat 720 

aacctgtatg gcatcgcgat tccgaacgat cagaccatta gcagcgtgac cagcaacatc 780 
ttttacagcc agtacaacgt gaaactggaa tatgcggaaa tctatgcgtt tggcggtccg 840 
accattgatc tgattccgaa aagcgcgcgc aaatacttcg aagaaaaagc gctggattac 900 
tatcgcagca ttgcgaaacg tctgaacagc attaccaccg cgaatccgag cagcttcaac 960 

aaatatatcg gcgaatataa acagaaactg atccgcaaat atcgctttgt ggtggaaagc 1020 

agcggcgaag ttaccgttaa ccgcaataaa ttcgtggaac tgtacaacga actgacccag 1080 

atcttcaccg aatttaacta tgcgaaaatc tataacgtgc agaaccgtaa aatctacctg 1140 

agcaacgtgt ataccccggt gaccgcgaat attctggatg ataacgtgta cgatatccag 1200 

aacggcttta acatcccgaa aagcaacctg aacgttctgt ttatgggcca gaacctgagc 1260 

cgtaatccgg cgctgcgtaa agtgaacccg gaaaacatgc tgtacctgtt caccaaattt 1320 

tgcgtcgacg cgatagatgg tagatttggc ggtttcacgg gcgcacgcaa atcagcgcgt 1380 

aaattagcta accaggcgct agcgggcggt ggcggtagcg gcggtggcgg tagcggcggt 1440 

ggcggtagcg cactagtgct gcagtgtcgt gaactgctgg tgaaaaacac cgatctgccg 1500 

tttattggcg atatcagcga tgtgaaaacc gatatcttcc tgcgcaaaga tatcaacgaa 1560 

gaaaccgaag tgatctacta cccggataac gtgagcgttg atcaggtgat cctgagcaaa 1620 

aacaccagcg aacatggtca gctggatctg ctgtatccga gcattgatag cgaaagcgaa 1680 

attctgccgg gcgaaaacca ggtgttttac gataaccgta cccagaacgt ggattacctg 1740 

aacagctatt actacctgga aagccagaaa ctgagcgata acgtggaaga ttttaccttt 1800 

acccgcagca ttgaagaagc gctggataac agcgcgaaag tttacaccta ttttccgacc 1860 

ctggcgaaca aagttaatgc gggtgttcag ggcggtctgt ttctgatgtg ggcgaacgat 1920 

gtggtggaag atttcaccac caacatcctg cgtaaagata ccctggataa aatcagcgat 1980 

gttagcgcga ttattccgta tattggtccg gcgctgaaca ttagcaatag cgtgcgtcgt 2040 

ggcaatttta ccgaagcgtt tgcggttacc ggtgtgacca ttctgctgga agcgtttccg 2100 

gaatttacca ttccggcgct gggtgcgttt gtgatctata gcaaagtgca ggaacgcaac 2160 ' 

gaaatcatca aaaccatcga taactgcctg gaacagcgta ttaaacgctg gaaagatagc 2220 

tatgaatgga tgatgggcac ctggctgagc cgtattatca cccagttcaa caacatcagc 2280 

taccagatgt acgatagcct gaactatcag gcgggtgcga ttaaagcgaa aatcgatctg 2340 

gaatacaaaa aatacagcgg cagcgataaa gaaaacatca aaagccaggt tgaaaacctg 2400 
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aaaaacagcc tggatgtgaa aattagcgaa gcgatgaata acatcaacaa attcatccgc 2460 

gaatgcagcg tgacctacct gttcaaaaac atgctgccga aagtgatcga tgaactgaac 2520 

gaatttgatc gcaacaccaa agcgaaactg atcaacctga tcgatagcca caacattatt 2580 

ctggtgggcg aagtggataa actgaaagcg aaagttaaca acagcttcca gaacaccatc 2640 

ccgtttaaca tcttcagcta taccaacaac agcctgctga aagatatcat caacgaatac 2700 

ttcaatctag actag 2715 



<210> 18 

<211> 904 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 18 

Gly Ser Glu Phe Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp 
x 5 10 15 



Pro Val Asp Asn Lys Asn lie Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 " 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 HO 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg ..Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145" v 150- >• 155' 160 
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He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 *" 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin 
385 390 395 400 



WO 2006/059093 



23 



PCT/GB2005/004585 



Asn Gly Phe Asn lie Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Ala He Asp Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys Arg Glu Leu Leu Val Lys Asn 
485 490 495 



Thr Asp Leu Pro Phe He Gly Asp He Ser Asp Val Lys Thr Asp He 
500 505 510 



Phe Leu Arg Lys Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro 
515 520 525 



Asp Asn Val Ser Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu 
530 535 540 



His Gly Gin Leu Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu 
545 550 555 560 



He Leu Pro Gly Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn 
565 570 575 



Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser 
580 585 590 



Asp Asn Val Glu Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu 
595 600 605 



Asp Asn Ser Ala Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys 
610 615 620 



Val Asn Ala Gly Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp 
625 630 635 640 
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Val Val Glu Asp Phe Thr Thr Asn lie Leu Arg Lys Asp Thr Leu Asp 
645 650 655 



Lys lie Ser Asp Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu 
660 665 670 



Asn He Ser Asn Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala 
675 680 685 



Val Thr Gly Val Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He 
690 695 700 



Pro Ala Leu Gly Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn 
705 710 715 720 



Glu He He Lys Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg 
725 730 735 



Trp Lys Asp Ser Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He 
740 745 750 



He Thr Gin Phe Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn 
755 760 765 



Tyr Gin Ala Gly Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys 
770 775 780 



Tyr Ser Gly Ser Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu 
785 790 795 800 



Lys Asn Ser Leu Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn 
805 810 815 



Lys Phe He Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu 
820 825 830 



Pro Lys Val He Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala 
835 840 845 



Lys Leu He Asn Leu He Asp Ser His Asn He He Leu Val Gly Glu 
850 855 860 



Val Asp Lys Leu Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He 
865 " 870 875 880 



Pro Phe Asn He Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He 
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885 890 895 

lie Asn Glu Tyr Phe Asn Leu Asp 
900 

<210> 19 
<211> 2742 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 19 



ggatccgaat 


tcatgccgat 


caccatcaac 


aacttcaact acagcgatcc 


ggtggataac 


60 


aaaaacatcc 


tgtacctgga 


tacccatctg 


aataccctgg cgaacgaacc 


ggaaaaagcg 


120 


tttcgtatca 


ccggcaacat 


ttgggttatt 


ccggatcgtt ttagccgtaa 


cagcaacccg 


180 


aatctgaata 


aaccgccgcg 


tgttaccagc 


ccgaaaagcg gttattacga 


tccgaactat 


240 


ctgagcaccg 


atagcgataa 


agataccttc 


ctgaaagaaa tcatcaaact 


gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg 


atctatcgcc tgagcaccga 


tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc 


tttgatttcg atgtggattt 


caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg 


gtgaaaaccg gcagcattaa 


cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt 


gatccggaaa ccagcacctt 


taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt 


ggcgcgctga gcattattag 


cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc 


aacgatgttg gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc 


ctgatgcatg aactgaacca 


tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat 


cagaccatta gcagcgtgac 


cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa 


tatgcggaaa tctatgcgtt 


tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg aagaaaaagc 


gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg cgaatccgag 


cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt ttatgggcca 


gaacctgagc 


1260 


cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc tgtacctgtt 


caccaaattt 


1320 


, tgcgtcgacg 


gcafcoattAQ. 


c^cca^aact- 


aaatctctga ctagaaggtag 


atttggcggt 


1380 
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ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


ttagctaacc 


aggcgctagc gggcggtggc 


1440 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca gtgtcgtgaa 


1500 


ctgctggtga 


aaaacaccga 


tctgccgttt 


attggcgata 


tcagcgatgt gaaaaccgat 


1560 


atcttcctgc 


gcaaagatat 


caacgaagaa 


accgaagtga 


tctactaccc ggataacgtg 


1620 


agcgttgatc 


aggtgatcct 


gagcaaaaac 


accagcgaac 


atggtcagct ggatctgctg 


1680 


tatccgagca 


ttgatagcga 


aagcgaaatt 


ctgccgggcg 


aaaaccaggt gttttacgat 


1740 


aaccgtaccc 


agaacgtgga 


ttacctgaac 


agctattact 


acctggaaag ccagaaactg 


1800 


agcgataacg 


tggaagattt 


tacctttacc 


cgcagcattg 


aagaagcgct ggataacagc 


1860 


gcgaaagttt 


acacctattt 


tccgaccctg 


gcgaacaaag 


ttaatgcggg tgttcagggc 


1920 


ggtctgtttc 


tgatgtgggc 


gaacgatgtg 


gtggaagatt 


tcaccaccaa catcctgcgt 


1980 


aaagataccc 


tggataaaat 


cagcgatgtt 


agcgcgatta 


ttccgtatat tggtccggcg 


2040 


ctgaacatta 


gcaatagcgt 


gcgtcgtggc 


aattttaccg 


aagcgtttgc ggttaccggt 


2100 


gtgaccattc 


tgctggaagc 


gtttccggaa 


tttaccattc 


cggcgctggg tgcgtttgtg 


2160 


atctatagca 


aagtgcagga 


acgcaacgaa 


atcatcaaaa 


ccatcgataa ctgcctggaa 


2220 


cagcgtatta 


aacgctggaa 


agatagctat 


gaatggatga 


tgggcacctg gctgagccgt 


2280 


attatcaccc 


agttcaacaa 


catcagctac 


cagatgtacg 


atagcctgaa ctatcaggcg 


2340 


ggtgcgatta 


aagcgaaaat 


cgatctggaa 


tacaaaaaat 


acagcggcag cgataaagaa 


2400 


aacatcaaaa 


gccaggttga 


aaacctgaaa 


aacagcctgg 


atgtgaaaat tagcgaagcg 


2460 


atgaataaca 


tcaacaaatt 


catccgcgaa 


tgcagcgtga 


cctacctgtt caaaaacatg 


2520 


ctgccgaaag 


tgatcgatga 


actgaacgaa 


tttgatcgca 


acaccaaagc gaaactgatc 


2580 


aacctgatcg 


atagccacaa 


cattattctg 


gtgggcgaag 


tggataaact gaaagcgaaa 


2640 


gttaacaaca 


gcttccagaa 


caccatcccg 


tttaacatct 


tcagctatac caacaacagc 


2700 


ctgctgaaag 


atatcatcaa 


cgaatacttc 


aatctagact 


ag 


2742 



<210> 20 

<211> 913 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 20 



Gly Ser Glu Phe Met Pro lie Thr lie Asn Asn Phe Asn Tyr Ser Asp 
1 5 10 15 
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Pro Val Asp Asn Lys Asn lie Leu Tyr Leu "Asp THr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 " 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 110 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 ** 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 
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Thr Ser Asn lie Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 

260 "~ ~~"~~265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp .Tyr Tyr Arg Ser He 
290 . 2S5 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin 
385 390 395 400 



Asn Gly Phe Asn He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Gly He He Thr Ser 
435 440 445 



Lys. Thr Lys Ser Leu He Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
450 455 460 



Arg Lys Ser Ala Arg Lys Leu Ala Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe He Gly 
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500 505 510 



Asp lie Ser Asp Val Lys Thr Asp lie Phe Leu Arg Lys Asp He Asn 
515 520 525 



Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser Val Asp Gin 
530 535 540 



Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu Asp Leu Leu 
545 550 555 560 



Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly Glu Asn Gin 
565 570 575 



Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu Asn Ser Tyr 
580 585 590 



Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu Asp Phe Thr 
595 600 605 



Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala Lys Val Tyr 
610 615 620 



Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val Gin Gly 
625 630 635 640 



Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe Thr Thr 
645 650 655 



Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Ala 
660 665 670 



He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn Ser Val Arg 
675 680 685 



Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr He Leu 
690 695 700 



Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly Ala Phe Val 
705 710 715 720 



He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys Thr He Asp 
725 730 735 



Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser Tyr Glu Trp 

• — 740 r ~ - \ ::>K r •* "y 745 ; * * ; * 750 
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Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe Asn Asn He 
755 760 765 



Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly Ala He Lys 
770 775 780 



Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu 
785 790 795 800 



Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys 
805 810 815 



He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser 
820 825 830 



Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu 
835 840 845 



Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn Leu He Asp 
850 855 860 



Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu Lys Ala Lys 
865 870 875 880 



Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe Asn He Phe Ser Tyr 
885 890 895 



Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Leu 
900 905 910 



Asp 



<210> 


21 


<211> 


2673 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


21 



ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 



cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
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ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 3 60 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatacggt ggtttcatgg cgctagcggg cggtggcggt 1380 

agcggcggtg gcggtagcgg cggtggcggt agcgcactag tgctgcagtg tatcaaggtt 1440 

aacaactggg atttattctt cagcccgagt gaagacaact tcaccaacga cctgaacaaa 1500 

ggtgaagaaa tcacctcaga tactaacatc gaagcagccg aagaaaacat ctcgctggac 1560 

ctgatccagc agtactacct gacctttaat ttcgacaacg agccggaaaa catttctatc 1620 

gaaaacctga gctctgatat catcggccag ctggaactga tgccgaacat cgaacgtttc 1680 

ccaaacggta aaaagtacga gctggacaaa tataccatgt tccactacct gcgcgcgcag 1740 

gaatttgaac acggcaaatc ccgtatcgca ctgactaact ccgttaacga agctctgctc 1800 

aacccgtccc gtgtatacac cttcttctct agcgactacg tgaaaaaggt caacaaagcg 1860 

actgaagctg caatgttctt gggttgggtt gaacagcttg tttatgattt taccgacgag 1920 

acgtccgaag tatctactac cgacaaaatt gcggatatca ctatcatcat cccgtacatc 1980 

ggtccggctc tgaacattgg caacatgctg tacaaagacg acttcgttgg cgcactgatc 2040 
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ttctccggtg cggtgatcct gctggagttc atcccggaaa tcgccatccc ggtactgggc 2100 

acctttgctc tggtttctta cattgcaaac aaggttctga ctgtacaaac catcgacaac 2160 

gcgctgagca aacgtaacga aaaatgggat gaagtttaca aatatatcgt gaccaactgg 2220 

ctggctaagg ttaatactca gatcgacctc atccgcaaaa aaatgaaaga agcactggaa 2280 

aaccaggcgg aagctaccaa ggcaatcatt aactaccagt acaaccagta caccgaggaa 2340 

gaaaaaaaca acatcaactt caacatcgac gatctgtcct ctaaactgaa cgaatccatc 2400 

aacaaagcta tgatcaacat caacaagttc ctgaaccagt gctctgtaag ctatctgatg 2460 

aactccatga tcccgtacgg tgttaaacgt ctggaggact tcgatgcgtc tctgaaagac 2520 

gccctgctga aatacattta cgacaaccgt ggcactctga tcggtcaggt tgatcgtctg 2580 

aaggacaaag tgaacaatac cttatcgacc gacatccctt ttcagctcag taaatatgtc 2640 

gataaccaac gccttttgtc cactctagac tag 2673 



<210> 22 
<211> 890 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 22 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 HO 
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He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr.Arg_.Ser.-Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 ' 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



WO 2006/059093 Best Available Copy 34 



PCT/GB2005/004585 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 — 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Met Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val 
465 470 475 480 



Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn 
485 490 495 



Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala 
500 505 510 



Ala Glu Glu Asn He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr 
515 520 525 



Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser 
530 " 535 540 



Ser. Asp He He Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe 
545 550 555 560 



Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr 
565 570 575 



Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr 
580 585 590 



Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe 
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595 600 605 



Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala 
610 615 620 



Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu 
625 630 635 640 



Thr Ser Glu Val Ser Thr Thr Asp Lys lie Ala Asp He Thr He He 
645 650 655 



He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys 
660 665 670 



Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu 
675 680 685 



Glu Phe lie Pro Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu 
690 695 700 



Val Ser Tyr He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn 
705 710 715 720 



Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He 
725 730 735 



Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg 
740 745 750 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala 
755 760 765 



He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn 
770 775 780 



He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He 
785 790 795 800 



Asn Lys Ala Met He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val 
805 810 815 



Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu 
820 825 830 



Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp 

835 * ■ 840 .\.y. - ' " :> .B45 



WO 2006/059093 Best Available Copy 36 



PCT/GB2005/004585 



Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val 
850 855 860 

Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val 
865 870 875 880 

Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 

<210> 23 
<211> 2751 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 23 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg tatgctgctg 


actagcatcg ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg ttccgaagaa 


ctgaacctgg tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactdagta 


catccgtttc 


tctccggact tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg cagtattcaa 


aatcaacatc 


1140 
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gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatcgaagg 


tcgttacggt 


ggtttcatga 


cctctgaaaa 


atctcagacc 


1380 


ccgctggtta 


ccctgttcaa 


aaacgctatc 


atcaaaaacg 


cttacaaaaa 


aggtgaagcg 


1440 


ctagcgggtg 


gtggtggttc 


tggtggtggt 


ggttctggtg 


gtggtggttc 


tgcactagtg 


1500 


ctgcagtgta 


tcaaggttaa 


caactgggat 


ttattcttca 


gcccgagtga 


agacaacttc 


1560 


accaacgacc 


tgaacaaagg 


tgaagaaatc 


acctcagata 


ctaacatcga 


agcagccgaa 


1620 


gaaaacatct 


cgctggacct 


gatccagcag 


tactacctga 


cctttaattt 


cgacaacgag 


1680 


ccggaaaaca 


tttctatcga 


aaacctgagc 


tctgatatca 


tcggccagct 


ggaactgatg 


1740 


ccgaacatcg 


aacgtttccc 


aaacggtaaa 


aagtacgagc 


tggacaaata 


taccatgttc 


1800 


cactacctgc 


gcgcgcagga 


atttgaacac 


ggcaaatccc 


gtatcgcact 


gactaactcc 


1860 


gttaacgaag 


ctctgctcaa 


cccgtcccgt 


gtatacacct 


tcttctctag 


cgactacgtg 


1920 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga 


acagcttgtt 


1980 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc 


ggatatcact 


2040 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta 


caaagacgac 


2100 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat 


cccggaaatc 


2160 


gccatcccgg 


tactgggcac 


ctttgctctg 


gtttcttaca 


ttgcaaacaa 


ggttctgact 


2220 


gtacaaacca 


tcgacaacgc 


gctgagcaaa 


cgtaacgaaa 


aatgggatga 


agtttacaaa 


2280 


tatatcgtga 


ccaactggct 


ggctaaggtt 


aatactcaga 


tcgacctcat 


ccgcaaaaaa 


2340 


atgaaagaag 


cactggaaaa 


ccaggcggaa 


gctaccaagg 


caatcattaa 


ctaccagtac 


2400 


aaccagtaca 


ccgaggaaga 


aaaaaacaac 


atcaacttca 


acatcgacga 


tctgtcctct 


2460 


aaactgaacg 


aatccatcaa 


caaagctatg 


atcaacatca 


acaagttcct 


gaaccagtgc 


2520 


tctgtaagct 


atctgatgaa 


ctccatgatc 


ccgtacggtg 


ttaaacgtct 


ggaggacttc 


2580 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa 


tacatttacg 


acaaccgtgg 


cactctgatc 


2640 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg 


aacaatacct 


tatcgaccga 


catccctttt 


2700 


cagctcagta 


aatatgtcga 


taaccaacgc 


cttttgtcca 


ctctagacta 


g 


2751 



<210> 24 

<211> 916 

<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Synthetic 
<400> 24 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 " 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 • 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
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225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 * 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 "* 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Met Thr Ser Glu Lys Ser Gin Thr Pro Leu Val Thr 
450 455 460 



Leu Phe Lys Asn Ala He He Lys Asn Ala Tyr Lys Lys Gly Glu Ala 

i465~ - 470 • r 4=75 - 480 : 
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Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
485 490 495 



Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe 
500 505 510 



Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu 
515 520 525 



Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser 
530 535 540 



Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu 
545 550 555 560 



Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin 
565 570 575 



Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr 
580 585 590 



Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe 
595 600 605 



Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala 
610 615 620 



Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val 
625 630 635 640 



Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val 
645 650 655 



Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr 
660 665 670 



Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro 
675 680 685 



Ala Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala 
690 695 700 



Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He 
705 710 715 720 
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Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn 
725 730 735 



Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn 
740 745 750 



Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala 
755 760 765 



Lys Val Asn Thr Gin lie Asp Leu He Arg Lys Lys Met Lys Glu Ala 
770 775 780 



Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr 
785 790 795 800 



Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp 
805 810 815 



Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn 
820 825 830 



He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser 
835 840 845 



Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu 
850 855 860 



Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He 
865 870 875 .880 



Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr 
885 890 895 



Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu 
900 905 910 



Ser Thr Leu Asp 
915 



<210> 25 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
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<400> 25 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 
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atcgcactga 


ctaactccgt 


taacgaagct ctgctcaacc cgtcccgtgt atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac aaagcgactg aagctgcaat gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc gacgagacgt ccgaagtatc tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg tacatcggtc cggctctgaa cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca ctgatcttct ccggtgcggt gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta ctgggcacct ttgctctggt ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc gacaacgcgc tgagcaaacg taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc aactggctgg ctaaggttaa tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca ctggaaaacc aggcggaagc taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc gaggaagaaa aaaacaacat caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa tccatcaaca aagctatgat caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat ctgatgaact ccatgatccc gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg aaagacgccc tgctgaaata catttacgac 


2580 


aaccataaca 


ctctcratcacr 


fccaciafcfccrat ccrtctaaacrcr acaaaataaa caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct tttgtccact 


2700 


ctagactag 






2709 



<210> 26 

<211> 902 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 26 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 -10 15 

Asn Gly Val Asp lie Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 " 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly lie Ala lie Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 ' 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 ~ 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 " 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 ~ 560 
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Gly Gin Leu Glu Leu Met Pro Asn lie Glu Arg Phe Pro Asn Gly Lys 
- 565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu. Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
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805 810 815 

He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 

Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 

Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 

Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 

Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 ** 895 

Leu Leu Ser Thr Leu Asp 
900 

<210> 27 
<211> 2691 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400>. 27 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 
cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 
tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 3 00 
tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 
ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 
cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 
gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 
ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 
tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 
gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 
■SP9aaccatg--ti?^<?a^.ag&-Jt-aajpa^c»8rff^cgtat-taffg agatgtccgg tctggaagtt 780 
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agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcggcg 


1380 


ctagcgggcg gtggcggtag 


cggcggtggc 


ggtagcggcg 


gtggcggtag 


cgcactagtg 


1440 


ctgcagtgta 


tcaaggttaa 


caactgggat 


ttattcttca 


gcccgagtga 


agacaacttc 


1500 


accaacgacc 


tgaacaaagg 


tgaagaaatc 


acctcagata 


ctaacatcga 


agcagccgaa 


1560 


gaaaacatct 


cgctggacct 


gatccagcag 


tactacctga 


cctttaattt 


cgacaacgag 


1620 


ccggaaaaca 


tttctatcga 


aaacctgagc 


tctgatatca 


tcggccagct 


ggaactgatg 


1680 


ccgaacatcg 


aacgtttccc 


aaacggtaaa 


aagtacgagc 


tggacaaata 


taccatgttc 


1740 


cactacctgc 


gcgcgcagga 


atttgaacac 


ggcaaatccc 


gtatcgcact 


gactaactcc 


1800 


gttaacgaag 


ctctgctcaa 


cccgtcccgt 


gtatacacct 


tcttctctag 


cgactacgtg 


1860 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga 


acagcttgtt 


1920 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc 


ggatatcact 


1980 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta 


caaagacgac 


2040 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat 


cccggaaatc 


2100 


gccatcccgg 


tactgggcac 


ctttgctctg 


gtttcttaca 


ttgcaaacaa 


ggttctgact 


2160 


gtacaaacca 


tcgacaacgc 


gctgagcaaa 


cgtaacgaaa 


aatgggatga 


agtttacaaa 


2220 


tatatcgtga ccaactggct 


ggctaaggtt 


aatactcaga 


tcgacctcat 


ccgcaaaaaa 


2280 


atgaaagaag cactggaaaa 


ccaggcggaa 


gctaccaagg 


caatcattaa 


ctaccagtac 


2340 


aaccagtaca 


ccgaggaaga 


aaaaaacaac 


atcaacttca 


acatcgacga 


tctgtcctct 


2400 


aaactgaacg 


aatccatcaa 


caaagctatg 


atcaacatca 


acaagttcct 


gaaccagtgc 


2460 


tctgtaagct atctgatgaa 


ctccatgatc 


ccgtacggtg 


ttaaacgtct 


ggaggacttc 


2520 


gatgcgtctc tgaaagacgc 


cctgctgaaa 

fe-\i<a. Z- r.'. 


tacatttacg 


acaaccgtgg 


cactctgatc 


2580 
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ggtcaggttg atcgtctgaa ggacaaagtg aacaatacct tatcgaccga catccctttt 2640 
cagctcagta aatatgtcga taaccaacgc cttttgtcca ctctagacta g 2 691 



<210> 28 

<211> 896 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 28 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 

*■■'*' *180 ^ " • • ' "185 - 190 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 " 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 ~ 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 ' 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 
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Val Asp Gly lie He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser 
500 505 510 



Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He 
530 ^ 535 540 



Ser He Glu Asn Leu Ser Ser Asp lie lie Gly Gin Leu Glu Leu Met 
545 550 555 560 



Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
565 570 575 



Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn 
610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He 
645 650 655 



Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He 
660 665 670 
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Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser 
675 680 685 



Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val 
690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr 
705 710 715 " 720 



Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 " " 735 



Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 



Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe 
805 810 815 



Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr 
820 825 830 



Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
835 840 845 



Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp 
850 855 860 



Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe 
865 870 875 880 



Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 895 



<210> 29 

<211> 2691 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic 
<400> 29 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atatgcggcg 


1380 


ctagcgggcg 


gtggcggtag 


cggcggtggc 


ggtagcggcg 


gtggcggtag 


cgcactagtg 


1440 


ctgcagtgta 


tcaaggttaa 


caactgggat 


ttattcttca 


gcccgagtga 


agacaacttc 


1500 


accaacgacc 


tgaacaaagg 


tgaagaaatc 


acctcagata 


ctaacatcga 


agcagccgaa 


1560 


gaaaacatct 


cgctggacct 


gatccagcag 


tactacctga 


cctttaattt 


cgacaacgag 


1620 


ccggaaaaca 


tttctatcga 


aaacctgagc 


tctgatatca 


tcggccagct 


ggaactgatg 


1680 


ccgaacatcg 


aacgtttccc 


aaacggtaaa 


aagtacgagc 


tggacaaata 


taccatgtte 


1740 
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cactacctgc 


gcgcgcagga 


atttgaacac 


ggcaaatccc 


gtatcgcact 


gactaactcc 


1800 


gttaacgaag 


ctctgctcaa 


cccgtcccgt 


gtatacacct 


tcttctctag 


cgactacgtg 


1860 


aaaaaggtca 


acaaagcgac 


tgaagctgca 


atgttcttgg 


gttgggttga 


acagcttgtt 


1920 


tatgatttta 


ccgacgagac 


gtccgaagta 


tctactaccg 


acaaaattgc 


ggatatcact 


1980 


atcatcatcc 


cgtacatcgg 


tccggctctg 


aacattggca 


acatgctgta 


caaagacgac 


2040 


ttcgttggcg 


cactgatctt 


ctccggtgcg 


gtgatcctgc 


tggagttcat 


cccggaaatc 


2100 


gccatcccgg 


tactgggcac 


ctttgctctg 


gtttcttaca 


ttgcaaacaa 


ggttctgact 


2160 


gtacaaacca 


tcgacaacgc 


gctgagcaaa 


cgtaacgaaa 


aatgggatga 


agtttacaaa 


2220 


tatatcgtga 


ccaactggct 


ggctaaggtt 


aatactcaga 


tcgacctcat 


ccgcaaaaaa 


2280 


atgaaagaag 


cactggaaaa 


ccaggcggaa 


gctaccaagg 


caatcattaa 


ctaccagtac 


2340 


aaccagtaca 


ccgaggaaga 


aaaaaacaac 


atcaacttca 


acatcgacga 


tctgtcctct 


2400 


aaactgaacg 


aatccatcaa 


caaagctatg 


atcaacatca 


acaagttcct 


gaaccagtgc 


2460 


tctgtaagct 


atctgatgaa 


ctccatgatc 


ccgtacggtg 


ttaaacgtct 


ggaggacttc 


2520 


gatgcgtctc 


tgaaagacgc 


cctgctgaaa 


tacatttacg 


acaaccgtgg 


cactctgatc 


2580 


ggtcaggttg 


atcgtctgaa 


ggacaaagtg 


aacaatacct 


tatcgaccga 


catccctttt 


2640 


cagctcagta 


aatatgtcga 


taaccaacgc 


cttttgtcca 


ctctagacta 


g 


2691 



<210> 30 

<211> 896 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 30 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 

Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 

Pro- Glu" J£lc^-Lys' Gin 'Val" Pro Val Ser Tyr Tyr Asp Ser" Thr Tyr Leu 
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65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 

305t,^..:r - 31C ~ « -" -315 - 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Tyr Ala Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr Ser 
500 505 510 



Asp Thr Asn lie Glu Ala Ala Glu Glu Asn lie Ser Leu Asp Leu lie 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn lie 
530 ~ 535 540 



Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met 
545 550 . - .555, 560 
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Pro Asn lie Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
565 570 575 



Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn 
610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys lie 
645 650 655 



Ala Asp lie Thr lie lie lie Pro Tyr lie Gly Pro Ala Leu Asn lie 
660 665 670 



Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser 
675 680 685 



Gly Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala lie Pro Val 
690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr lie Ala Asn Lys Val Leu Thr 
705 710 715 720 



Val Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 735 



Glu Val Tyr Lys Tyr lie Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin lie Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala, Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 
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Lys Leu Asn Glu Ser lie Asn Lys Ala Met lie Asn lie Asn Lys Phe 
805 810 815 

Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met lie Pro Tyr 
820 825 830 

Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
835 840 . 845 

Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie Gly Gin Val Asp 
850 855 860 

Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp lie Pro Phe 
865 870 875 880 

Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 



<210> 31 
<211> 2691 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 31 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 

attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 18 0 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 
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gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atcatatgcg 1380 

ctagcgggcg gtggcggtag cggcggtggc ggtagcggcg gtggcggtag cgcactagtg 144 0 

ctgcagtgta tcaaggttaa caactgggat ttattcttca gcccgagtga agacaacttc 1500 

accaacgacc tgaacaaagg tgaagaaatc acctcagata ctaacatcga agcagccgaa 1560 

gaaaacatct cgctggacct gatccagcag tactacctga cctttaattt cgacaacgag 1620 

ccggaaaaca tttctatcga aaacctgagc tctgatatca tcggccagct ggaactgatg 1680 

ccgaacatcg aacgtttccc aaacggtaaa aagtacgagc tggacaaata taccatgttc 1740 

cactacctgc gcgcgcagga atttgaacac ggcaaatccc gtatcgcact gactaactcc 1800 

gttaacgaag ctctgctcaa cccgtcccgt gtatacacct tcttctctag cgactacgtg 1860 

aaaaaggtca acaaagcgac tgaagctgca atgttcttgg gttgggttga acagcttgtt 1920 

tatgatttta ccgacgagac gtccgaagta tctactaccg acaaaattgc ggatatcact 1980 

atcatcatcc cgtacatcgg tccggctctg aacattggca acatgctgta caaagacgac 2040 

ttcgttggcg cactgatctt ctccggtgcg gtgatcctgc tggagttcat cccggaaatc 2100 

gccatcccgg tactgggcac ctttgctctg gtttcttaca ttgcaaacaa ggttctgact 2160 

gtacaaacca tcgacaacgc gctgagcaaa cgtaacgaaa aatgggatga agtttacaaa 2220 

tatatcgtga ccaactggct ggctaaggtt aatactcaga tcgacctcat ccgcaaaaaa 2280 

atgaaagaag cactggaaaa ccaggcggaa gctaccaagg caatcattaa ctaccagtac 2340 

aaccagtaca ccgaggaaga aaaaaacaac atcaacttca acatcgacga tctgtcctct 2400 

aaactgaacg aatccatcaa caaagctatg atcaacatca acaagttcct gaaccagtgc 2460 

tctgtaagct atctgatgaa ctccatgatc ccgtacggtg ttaaacgtct ggaggacttc 2520 

gatgcgtctc tgaaagacgc cctgctgaaa tacatttacg acaaccgtgg cactctgatc 2580 

ggtcaggttg atcgtctgaa ggacaaagtg aacaatacct tatcgaccga catccctttt 2640 

cagctcagta aatatgtcga taaccaacgc cttttgtcca ctctagacta g 2691 
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<210> 


32 


<211> 


896 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic 


<400> 


32 


Gly Ser Met Glu Phe Val Asn 


1 


5 



10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp- Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr "Aifr Pro 
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195 



200 



205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe. Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 

'^435" ' 440 * ~" ^ 445- 
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Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Tyr Ala Leu Ala Gly Gly 
450 455 460 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val 
465 470 475 480 



Leu Gin Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser 
485 490 495 



Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr Ser 
500 505 510 



Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He 
515 520 525 



Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He 
530 535 540 



Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met 
545 550 555 560 



Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys 
565 570 575 



Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys 
580 585 590 



Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
595 600 605 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys, Lys Val Asn 
610 615 620 



Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val 
625 630 635 640 



Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He 
645 650 655 



Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He 
660 665 670 



Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser 

675 . .. 680 685 
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Gly Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala lie Pro Val 
690 695 700 



Leu Gly Thr Phe Ala Leu Val Ser Tyr lie Ala Asn Lys Val Leu Thr 
705 710 715 720 



Val Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp 
725 730 735 



Glu Val Tyr Lys Tyr lie Val Thr Asn Trp Leu Ala Lys Val Asn Thr 
740 745 750 



Gin lie Asp Leu lie Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin 
755 760 765 



Ala Glu Ala Thr Lys Ala lie lie Asn Tyr Gin Tyr Asn Gin Tyr Thr 
770 775 780 



Glu Glu Glu Lys Asn Asn lie Asn Phe Asn He Asp Asp Leu Ser Ser 
785 790 795 800 



Lys Leu Asn Glu Ser He Asn Lys Ala Met lie Asn He Asn Lys Phe 
805 810 815 



Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr 
820 825 830 



Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
835 840 845 



Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp 
850 855 860 



Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe 
865 870 875 880 



Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 895 



<210> 33 

<211> 2709 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
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<400> 33 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 

attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 72 0 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 102 0 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 114 0 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa- atcagcgcgt 1380 

aaatatgcta accaggcgct agcgggcggt ggcggtagcg gcggtggcgg tagcggcggt 1440 

ggcggtagcg cactagtgct gcagtgtatc aaggttaaca actgggattt attcttcagc 1500 

ccgagtgaag acaacttcac caacgacctg aacaaaggtg aagaaatcac ctcagatact 1560 

aacatcgaag cagccgaaga aaacatctcg ctggacctga tccagcagta ctacctgacc 1620 

tttaatttcg acaacgagcc ggaaaacatt tctatcgaaa acctgagctc tgatatcatc 1680 

ggccagctgg aactgatgcc gaacatcgaa cgtttcccaa acggtaaaaa gtacgagctg 1740 

gacaaatata ccatgttcca ctacctgcgc gcgcaggaat ttgaacacgg caaatcccgt 1800 
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atcgcactga 


ctaactccgt 


taacgaagct ctgctcaacc cgtcccgtgt atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac aaagcgactg aagctgcaat gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc gaegagaegt ccgaagtatc tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg tacateggtc eggctctgaa cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca ctgatcttct ccggtgcggt gatcctgetg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta ctgggcacct ttgctctggt ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc gacaacgcgc tgagcaaacg taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tategtgace aactggctgg ctaaggttaa tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca ctggaaaacc aggeggaage taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc gaggaagaaa aaaacaacat caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa tccatcaaca aagctatgat caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat ctgatgaact ccatgatccc gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg aaagacgccc tgctgaaata catttacgac 


2580 


aaccguggca 


cue ugancgg 


tcaggttgat cgtctgaagg acaaagtgaa caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct tttgtccact 


2700 


ctagactag 






2709 



<210> 34 

<211> 902 

<212>. PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 34 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 

Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val lie Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro. Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
• 245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 ' 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 

305 310 - . 315 ... 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Tyr Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He 
545 550 555 560 
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Gly Gin Leu Glu Leu Met Pro Asn lie Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 ~ 590 



Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie 
660 665 670 



Gly Pro Ala Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu lie Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala lie Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu. Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
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805 810 815 

lie Asn lie Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 

Asn Ser Met lie Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 

Ser Leu Lys Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr 
850 855 860 

Leu lie Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 

Ser Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 

Leu Leu Ser Thr Leu Asp 
900 

<210> 35 
<211> 2697 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400>. 35 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag 


acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc 


cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta 


ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt 


actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta 


ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg 


ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca 


ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg 


tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tettcaaagt 


taa~cScc'aac:.gdgtattacg 


agatgtccgg 


tctggaagtt 


780 
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agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaagcgctag 


cgggcggtgg 


cggtagcggc 


ggtggcggta 


gcggcggtgg 


cggtagcgca 


1440 


ctagtgctgc 


agtgtatcaa 


ggttaacaac 


tgggatttat 


tcttcagccc 


gagtgaagac 


1500 


aacttcacca 


acgacctgaa 


caaaggtgaa 


gaaatcacct 


cagatactaa 


catcgaagca 


1560 


gccgaagaaa 


acatctcgct 


ggacctgatc 


cagcagtact 


acctgacctt 


taatttcgac 


1620 


aacgagccgg 


aaaacatttc 


tatcgaaaac 


ctgagctctg 


atatcatcgg 


ccagctggaa 


1680 


ctgatgccga 


acatcgaacg 


tttcccaaac 


ggtaaaaagt 


acgagctgga 


caaatatacc 


1740 


atgttccact 


acctgcgcgc 


gcaggaattt 


gaacacggca 


aatcccgtat 


cgcactgact 


1800 


aactccgtta 


acgaagctct 


gctcaacccg 


tcccgtgtat 


acaccttctt 


ctctagcgac 


1860 


tacgtgaaaa 


aggtcaacaa 


agcgactgaa 


gctgcaatgt 


tcttgggttg 


ggttgaacag 


1920 


cttgtttatg 


attttaccga 


cgagacgtcc 


gaagtatcta 


ctaccgacaa 


aattgcggat 


1980 


atcactatca 


tcatcccgta 


catcggtccg 


gctctgaaca 


ttggcaacat 


gctgtacaaa 


2040 


gacgacttcg 


ttggcgcact 


gatcttctcc 


ggtgcggtga 


tcctgctgga 


gttcatcccg 


2100 


gaaatcgcca 


tcccggtact 


gggcaccttt 


gctctggttt 


cttacattgc 


aaacaaggtt 


2160 


ctgactgtac 


aaaccatcga 


caacgcgctg 


agcaaacgta 


acgaaaaatg 


ggatgaagtt 


2220 


tacaaatata 


tcgtgaccaa 


ctggctggct 


aaggttaata 


ctcagatcga 


cctcatccgc 


2280 


aaaaaaatga 


aagaagcact 


ggaaaaccag 


gcggaagcta 


ccaaggcaat 


cattaactac 


2340 


cagtacaacc 


agtacaccga 


ggaagaaaaa 


aacaacatca 


acttcaacat 


cgacgatctg 


2400 


tcctctaaac 


tgaacgaatc 


catcaacaaa 


gctatgatca 


acatcaacaa 


gttcctgaac 


2460 


cagtgctctg 


taagctatct 


gatgaactcc 


atgatcccgt 


acggtgttaa 


acgtctggag 


2520 


gacttcgatg 


cgtctctgaa 


agacgccctg 


ctgaaataca 


tttacgacaa 


ccgtggcact 


2580 
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ctgatcggtc aggttgatcg tctgaaggac aaagtgaaca ataccttatc gaccgacatc 2640 
ccttttcagc tcagtaaata tgtcgataac caacgccttt tgtccactct agactag 2697 



<210> 36 

<211> 898 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 36 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 ^ 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg. Asn Gly Tyr .Gly Ser Thr Gin -Tyr lie. Arg Phe Ser Pro 
180 185 190 
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Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu lie His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 " 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn He Val* Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 

420 . 425. ... — 430 



WO 2006/059093 



73 



PCT/GB2005/004585 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Ala Leu Ala 
450 455 460 



Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala 
465 470 475 480 



Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser 
485 490 495 



Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie 
500 505 510 



Thr Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn lie Ser Leu Asp 
515 520 525 



Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu 
530 535 540 



Asn lie Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu 
545 550 555 560 



Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 
565 570 575 



Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His 
580 585 590 



Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu 
595 600 605 



Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 
610 615 620 



Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin 
625 630 635 640 



Leu. Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 
645 650 655 



Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu 
660 665 670 
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Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie 
675 680 685 



Phe Ser Gly Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala lie 
690 695 700 



Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr lie Ala Asn Lys Val 
705 710 715 720 



Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys 
725 730 735 



Trp Asp Glu Val Tyr Lys Tyr lie Val Thr Asn Trp Leu Ala Lys Val 
740 745 750 



Asn Thr Gin lie Asp Leu lie Arg Lys Lys Met Lys Glu Ala Leu Glu 
755 760 765 



Asn Gin Ala Glu Ala Thr Lys Ala lie lie Asn Tyr Gin Tyr Asn Gin 
770 775 780 



Tyr Thr Glu Glu Glu Lys Asn Asn lie Asn Phe Asn lie Asp Asp Leu 
785 790 795 800 



Ser Ser Lys Leu Asn Glu Ser lie Asn Lys Ala Met lie Asn lie Asn 
805 810 815 



Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met lie 
820 825 830 



Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp 
835 840 845 



Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie Gly Gin 
850 855 860 



Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp lie 
865 870 875 880 



Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr 
885 890 895 



Leu Asp 



<210> 37 
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<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 37 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca g 51 



<210> 38 

<211> 17 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 38 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
1 ^ 5 10 15 



Gin 



<210> 39 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 39 

tttggcggtt tcacgggcgc acgcaaatca gcg 33 



<210> 40 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 40 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala 
15 10 



<210> 41 

<211> 33 

<212> DNA 

<213> Artificial 



Sequence 



<220> 
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<223> Synthetic 



<400> 41 

tttggcggtt tcacgggcgc acgcaaatat gcg 



33 



<210> 42 
<211> 11 
<212> PRT . 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 42 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Tyr Ala 
15 10 



<210> 43 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 43 

tttggcggtt tcacgggcgc acgcaaatca tat 33 



<210> 44 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 45 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
,<220> 

<223> Synthetic 

<400> 45 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat atgctaacca g 51 



<400> 



44 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Tyr 
1 5 10 



<210> 46 
<211> 17 
<212> PRT 



WO 2006/059093 



77 



PCT/GB2005/004585 



<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 46 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Tyr Ala Asn 
15 10 15 



Gin 



<210> 47 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 47 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaa 39 



<210> 48 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 48 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys 
1 5 10 



<210> 49 

<211> 51 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



Synthetic 



<400> 49 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gcaaaaacca g 



51 



<210> 50 
<211> 17 
<212> PRT 



<213> Artificial Sequence 



<220> 
<223> 



Synthetic 
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<400> 50 

Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
1 5 10 15 

Gin 

<210> 51 

<211> 2736 

<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Synthetic 

<400> 51 



ctcgggattg 


agggtcgttt 


tggcggtttc 


acgggcgcac 


gcaaatcagc 


gcgtaaatta 


60 


gctaaccaga 


ctagtggcgg 


tgggggtagt 


ggcggtggcg 


gttcgggcgg gggtgggagc 


120 


cctaggggat 


ccatggagtt 


cgttaacaaa 


cagttcaact 


ataaagaccc 


agttaacggt 


180 


gttgacattg 


cttacatcaa 


aatcccgaac 


gctggccaga 


tgcagccggt 


aaaggcattc 


240 


aaaatccaca 


acaaaatctg 


ggttatcccg 


gaacgtgata 


cctttactaa 


cccggaagaa 


300 


ggtgacctga 


acccgccacc 


ggaagcgaaa 


caggtgccgg 


tatcttacta 


tgactccacc 


360 


tacctgtcta 


ccgataacga 


aaaggacaac 


tacctgaaag 


gtgttactaa 


actgttcgag 


420 


cgtatttact 


ccaccgacct 


gggccgtatg 


ctgctgacta 


gcatcgttcg cggtatcccg 


480 


ttctggggcg 


gttctaccat 


cgataccgaa 


ctgaaagtaa 


tcgacactaa 


ctgcatcaac 


540 


gttattcagc 


cggacggttc 


ctatcgttcc 


gaagaactga 


acctggtgat 


catcggcccg 


600 


tctgctgata 


tcatccagtt 


cgagtgtaag 


agctttggtc 


acgaagttct 


gaacctcacc 


660 


cgtaacggct 


acggttccac 


tcagtacatc 


cgtttctctc 


cggacttcac 


cttcggtttt 


720 


gaagaatccc 


tggaagtaga 


cacgaaccca 


ctgctgggcg 


ctggtaaatt 


cgcaactgat 


780 


cctgcggtta 


ccctggctca 


cgaactgatt 


catgcaggcc 


accgcctgta 


cggtatcgcc 


840 


atcaatccga 


accgtgtctt 


caaagttaac 


accaacgcgt 


attacgagat gtccggtctg 


900 


gaagttagct 


tcgaagaact 


gcgtactttt 


ggcggtcacg 


acgctaaatt 


catcgactct 


960 


ctgcaagaaa 


acgagttccg 


tctgtactac 


tataacaagt 


tcaaagatat 


cgcatccacc 


1020 


ctgaacaaag 


cgaaatccat 


cgtgggtacc 


actgcttctc 


tccagtacat gaagaacgtt 


1080 


tttaaagaaa 


aatacctgct 


cagcgaagac 


acctccggca 


aattctctgt 


agacaagttg 


1140 


aaattcgata 


aactttacaa 


aatgctgact 


gaaatttaca 


ccgaagacaa cttcgttaag 


1200 


ttctttaaag 


ttctgaaccg 


caaaacctat 


ctgaacttcg 


acaaggcagt 


attcaaaatc 


1260 
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aacatcgtgc 


cgaaagttaa 


ctacactatc 


tacgatggtt 


tcaacctgcg 


taacaccaac 


1320 


ctggctgcta 


attttaacgg 


ccagaacacg 


gaaatcaaca 


acatgaactt 


cacaaaactg 


1380 


aaaaacttca 


ctggtctgtt 


cgagttttac 


aagctgctgt 


gcgtcgacgg 


catcattacc 


1440 


tccaaaacta 


aatctctgat 


agaaggtaga 


aacaaagcgc 


tgaacctgca gtgtatcaag 


1500 


gttaacaact 


gggatttatt 


cttcagcccg 


agtgaagaca 


acttcaccaa 


cgacctgaac 


1560 


aaaggtgaag 


aaatcacctc 


agatactaac 


atcgaagcag 


ccgaagaaaa 


catctcgctg 


1620 


gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


acgagccgga 


aaacatttct 


1680 


atcgaaaacc 


tgagctctga 


tatcatcggc 


cagctggaac 


tgatgccgaa 


catcgaacgt 


1740 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


tgttccacta 


cctgcgcgcg 


1800 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


1860 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tctagcgact 


acgtgaaaaa 


ggtcaacaaa 


1920 


gcgactgaag 


ctgcaatgtt 


cttgggttgg 


gttgaacagc 


ttgtttatga 


ttttaccgac 


1980 


gagacgtccg 


aagtatctac 


taccgacaaa 


attgcggata 


tcactatcat 


catcccgtac 


2040 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


acgacttcgt 


tggcgcactg 


2100 


atcttctccg 


gtgcggtgat 


cctgctggag 


ttcatcccgg 


aaatcgccat 


cccggtactg 


2160 


ggcacctttg 


ctctggtttc 


ttacattgca 


aacaaggttc 


tgactgtaca aaccatcgac 


2220 


aacgcgctga 


gcaaacgtaa 


cgaaaaatgg 


gatgaagttt 


acaaatatat 


cgtgaccaac 


2280 


tggctggcta 


aggttaatac 


tcagatcgac 


ctcatccgca 


aaaaaatgaa 


agaagcactg 


2340 


gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc agtacaacca gtacaccgag 


2400 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt 


cctctaaact 


gaacgaatcc 


2460 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc 


agtgctctgt 


aagctatctg 


2520 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg 


acttcgatgc 


gtctctgaaa 


2580 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc 


tgatcggtca ggttgatcgt 


2640 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc 


cttttcagct 


cagtaaatat 


2700 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 






2736 



<210> 52 

<211> 911 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 52 



WO 2006/059093 



80 



PCT/GB2005/004585 



Leu Gly lie Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser 
1 5 10 15 



Ala Arg Lys Leu Ala Asn Gin Thr Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser Gly Gly Gly Gly Ser Pro Arg Gly Ser Met Glu Phe Val 
35 40 ' 45 



Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly Val Asp lie Ala 
50 55 60 



Tyr lie Lys lie Pro Asn Ala Gly Gin Met Gin Pro Val Lys Ala Phe 
65 70 75 80 



Lys lie His Asn Lys lie Trp Val lie Pro Glu Arg Asp Thr Phe Thr 
85 90 95 



Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala Lys Gin Val 
100 105 110 



Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp Asn Glu Lys 
115 120 125 



Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg lie Tyr Ser 
130 135 140 



Thr Asp Leu Gly Arg Met Leu Leu Thr Ser lie Val Arg Gly lie Pro 
145 150 155 160 



Phe Trp Gly Gly Ser Thr lie Asp Thr Glu Leu Lys Val lie Asp Thr 
165 170 175 



Asn Cys lie Asn Val lie Gin Pro Asp Gly Ser Tyr Arg Ser Glu Glu 
180 185 190 



Leu. Asn Leu Val lie lie Gly Pro Ser Ala Asp He He Gin Phe Glu 
195 200 205 



Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg Asn Gly Tyr 
210 215 220 



Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe Thr Phe Gly Phe 
225 230 235 240 



Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly Ala Gly Lys 
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245 250 255 



Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu He His Ala 
260 265 270 



Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn Arg Val Phe Lys 
275 280 285 



Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu Val Ser Phe 
290 295 300 



Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe He Asp Ser 
305 310 315 320 



Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys Phe Lys Asp 
325 330 335 



lie Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val Gly Thr Thr Ala 
340 345 350 



Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr Leu Leu Ser 
355 360 365 



Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys Phe Asp Lys 
370 375 380 



Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp Asn Phe Val Lys 
385 390 395 400 



Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe Asp Lys Ala 
405 410 415 



Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr Thr He Tyr Asp 
420 425 430 



Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe Asn Gly Gin 
435 440 445 



Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu Lys Asn Phe Thr 
450 455 460 



Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly He He Thr 
465 470 475 480 



Ser Lys Thr Lys Ser Leu He Glu Gly Arg Asn Lys Ala Leu Asn Leu 

- -iz V. •» 48S/.:- * • -490 - ; 495 
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Gin Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 630 635 640 



Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 
660 665 670 



Asp He Thr He He He Pro Tyr He Gly Pro Ala" Leu" Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly 
690 ~ 695 700 



Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys Val Leu Thr Val 
725 . . 730 735 
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Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 



Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val Asp Arg 
865 870 875 880 



Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro Phe Gin 
885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 



<210> 


53 


<211> 


177 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


53 



ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctgcact agtgctgcag acgcacggtc tagaatgata aaagctt 177 
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<210> 54 
<211> 192 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 54 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctggtgg tggtggttct gcactagtgc tgcagacgca cggtctagaa 180 
tgataaaagc tt 192 



<210> 55 
<211> 222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 55 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 
ggtggtggtg gttctggtgg tggtggttct ggtggtggtg gttctggtgg tggtggttct 180 
gcactagtgc tgcagacgca cggtctagaa tgataaaagc tt 222 



<210> 56 
<211> 237 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 56 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 
ggtggtggtg gttctggtgg tggtggttct ggtggtggtg gttctggtgg tggtggttct 
ggtggtggtg gttctgcact agtgctgcag acgcacggtc tagaatgata aaagctt 



<210> 57 

<211> 228 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



60 
120 
180 
237 
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<400> 57 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 

tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaat tagctaacca ggcgctagcg 120 

gctgaagctg ctgctaaaga agctgctgct aaagaagctg ctgctaaagc tggtggcggt 180 

ggttccgcac tagtgctgca gacgcacggt ctagaatgat aaaagctt 228 

<210> 58 
<211> 2694 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 58 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 

attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 

cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 

ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 

tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 72 0 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 
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ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatttggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggtggt 


ggtggttctg 


gtggtggtgg 


ttctgcacta 


1440 


gtgctgcagt 


gtatcaaggt 


taacaactgg 


gatttattct 


tcagcccgag 


tgaagacaac 


1500 


ttcaccaacg 


acctgaacaa 


aggtgaagaa 


atcacctcag 


atactaacat 


cgaagcagcc 


1560 


gaagaaaaca 


tctcgctgga 


cctgatccag 


cagtactacc 


tgacctttaa 


tttcgacaac 


1620 


gagccggaaa 


acatttctat 


cgaaaacctg 


agctctgata 


tcatcggcca 


gctggaactg 


1680 


atgccgaaca 


tcgaacgttt 


cccaaacggt 


aaaaagtacg 


agctggacaa 


atataccatg 


1740 


ttccactacc 


tgcgcgcgca 


ggaatttgaa 


cacggcaaat 


cccgtatcgc 


actgactaac 


1800 


tccgttaacg 


aagctctgct 


caacccgtcc 


cgtgtataca 


ccttcttctc 


tagcgactac 


1860 


gtgaaaaagg 


tcaacaaagc 


gactgaagct 


gcaatgttct 


tgggttgggt 


tgaacagctt 


1920 


gtttatgatt 


ttaccgacga 


gacgtccgaa 


gtatctacta 


ccgacaaaat 


tgcggatatc 


1980 


actatcatca 


tcccgtacat 


cggtccggct 


ctgaacattg 


gcaacatgct 


gtacaaagac 


2040 


gacttcgttg 


gcgcactgat 


cttctccggt 


gcggtgatcc 


tgctggagtt 


catcccggaa 


2100 


atcgccatcc 


cggtactggg 


cacctttgct 


ctggtttctt 


acattgcaaa 


caaggttctg 


2160 


actgtacaaa 


ccatcgacaa 


cgcgctgagc 


aaacgtaacg 


aaaaatggga 


tgaagtttac 


2220 


aaatatatcg 


tgaccaactg 


gctggctaag 


gttaatactc 


agatcgacct 


catccgcaaa 


2280 


aaaatgaaag 


aagcactgga 


aaaccaggcg 


gaagctacca 


aggcaatcat 


taactaccag 


2340 


tacaaccagt 


acaccgagga 


agaaaaaaac 


aacatcaact 


tcaacatcga 


cgatctgtcc 


2400 


tctaaactga 


acgaatccat 


caacaaagct 


atgatcaaca 


tcaacaagtt 


cctgaaccag 


2460 


tgctctgtaa 


gctatctgat 


gaactccatg 


atcccgtacg 


gtgttaaacg 


tctggaggac 


2520 


ttcgatgcgt 


ctctgaaaga 


cgccctgctg 


aaatacattt 


acgacaaccg 


tggcactctg 


2580 


atcggtcagg 


ttgatcgtct 


gaaggacaaa 


gtgaacaata 


ccttatcgac 


cgacatccct 


2640 


tttcagctca 


gtaaatatgt 


cgataaccaa 


cgccttttgt 


ccactctaga 


ctag 


2694 



<210> 59 

<211> 897 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 59 



-€3rys0eac Mefe-Glu- Phe Val Asir hys Gin Phe Asn Tyr Lys Asp Pro Val * 
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10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr lie Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 

z.i . ^ .? .245: r - r 250 *>• ~ 255 
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Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 " 430 



Val Asp Gly. He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu 
465 470 475 480 



Val Leu Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro 
485 490. 495 
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Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr 
500 505 510 



Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu 
515 520 525 



He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn 
530 535 540 



He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu 
545 550 555 560 



Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp 
565 570 575 



Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly 
580 585 590 



Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn 
595 600 605 



Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val 
610 615 620 



Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu 
625 630 635 640 



Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys 
645 650 655 



He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn 
660 665 670 



He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu He Phe 
675 680 685 



Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala He Pro 
690 695 700 



Val Leu Gly Thr Phe Ala Leu Val Ser Tyr lie Ala Asn Lys Val Leu 
705 710 715 720 



Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp 
725 730 735 
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Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys Val Asn 
740 745 750 



Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu Glu Asn 
755 760 " 765 



Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn Gin Tyr 
770 775 780 



Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser 
785 790 795 800 



Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys 
805 810 815 



Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro 
820 825 830 



Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala 
835 840 845 



Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly Gin Val 
850 855 860 



Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He Pro 
865 870 875 880 



Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu 
885 890 895 



Asp 



<210> 60 
<211> 2724 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 60 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 
cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 
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tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 

tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 

ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 

cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 

gatatcatcc agttcgagtg taagagcttt ggtcacgaag ttctgaacct cacccgtaac 540 

ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg ttttgaagaa 600 

tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac tgatcctgcg 660 

gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat cgccatcaat 720 

ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg tctggaagtt 780 

agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga ctctctgcaa 840 

gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc caccctgaac 900 

aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa cgtttttaaa 960 

gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa gttgaaattc 1020 

gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt taagttcttt 1080 

aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 1140 

gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac caacctggct 1200 

gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa actgaaaaac 1260 

ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 1320 

actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atcagcgcgt 1380 

aaattagcta accaggcgct agcgggtggt ggtggttctg gtggtggtgg ttctggtggt 1440 

ggtggttctg gtggtggtgg ttctgcacta gtgctgcagt gtatcaaggt taacaactgg 1500 

gatttattct tcagcccgag tgaagacaac ttcaccaacg acctgaacaa aggtgaagaa 1560 

atcacctcag atactaacat cgaagcagcc gaagaaaaca tctcgctgga cctgatccag 1620 

cagtactacc tgacctttaa tttcgacaac gagccggaaa acatttctat cgaaaacctg 1680 

agctctgata tcatcggcca gctggaactg atgccgaaca tcgaacgttt cccaaacggt 1740 

aaaaagtacg agctggacaa atataccatg ttccactacc tgcgcgcgca ggaatttgaa 1800 

cacggcaaat cccgtatcgc actgactaac tccgttaacg aagctctgct caacccgtcc 1860 

cgtgtataca ccttcttctc tagcgactac gtgaaaaagg tcaacaaagc gactgaagct 1920 

gcaatgttct tgggttgggt tgaacagctt gtttatgatt ttaccgacga gacgtccgaa 1980 

gtatctacta ccgacaaaat tgcggatatc actatcatca tcccgtacat cggtccggct 2040 

ctgaacattg gcaacatgct gtacaaagac gacttcgttg gcgcactgat cttctccggt 2100 
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gcggtgatcc 


tgctggagtt 


catcccggaa 


atcgccatcc cggtactggg 


cacctttgct 


2160 


ctggtttctt 


acattgcaaa 


caaggttctg 


actgtacaaa ccatcgacaa 


cgcgctgagc 


2220 


aaacgtaacg 


aaaaatggga 


tgaagtttac aaatatatcg tgaccaactg gctggctaag 


2280 


gttaatactc 


agatcgacct 


catccgcaaa 


aaaatgaaag aagcactgga 


aaaccaggcg 


2340 


gaagctacca 


aggcaatcat 


taactaccag 


tacaaccagt acaccgagga 


agaaaaaaac 


2400 


aacatcaact 


tcaacatcga 


cgatctgtcc 


tctaaactga acgaatccat 


caacaaagct 


2460 


atgatcaaca 


tcaacaagtt 


cctgaaccag tgctctgtaa gctatctgat 


gaactccatg 


2520 


atcccgtacg 


gtgttaaacg 


tctggaggac 


ttcgatgcgt ctctgaaaga 


cgccctgctg 


2580 


aaatacattt 


acgacaaccg 


tggcactctg 


atcggtcagg ttgatcgtct 


gaaggacaaa 


2640 


gtgaacaata 


ccttatcgac 


cgacatccct 


tttcagctca gtaaatatgt 


cgataaccaa 


2700 


cgccttttgt 


ccactctaga 


ctag 






2724 



<210> 61 

<211> 907 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 61 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 



Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 * 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 .70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 - 110 
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He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 " 160 



Asp He lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300* 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 
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Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 . . 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys 
485 490 ' 495 



Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr 
500 505 510 



Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp Thr Asn He Glu 
515 520 525 



Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin Gin Tyr Tyr Leu 
530 535 540 



Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser He Glu Asn Leu 
545 550 555 560 



Ser Ser Asp lie lie Gly Gin Leu Glu Leu Met Pro Asn lie Glu Arg 
565 570 575 



Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His 
580 585 590 



Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu 
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595 600 605 



Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr 
610 615 620 



Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala 
625 630 635 640 



Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp 
645 650 655 



Glu Thr Ser Glu Val Ser Thr Thr Asp Lys lie Ala Asp lie Thr lie 
660 665 670 



lie lie Pro Tyr lie Gly Pro Ala Leu Asn lie Gly Asn Met Leu Tyr 
675 680 685 



Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser Gly Ala Val lie Leu 
690 695 700 



Leu Glu Phe lie Pro Glu lie Ala lie Pro Val Leu Gly Thr Phe Ala 
705 710 715 720 



Leu Val Ser Tyr lie Ala Asn Lys Val Leu Thr Val Gin Thr lie Asp 
725 730 735 



Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr 
740 745 750 



lie Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin lie Asp Leu He 
755 760 765 



Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys 
770 775 780 



Ala He He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn 
785 790 795 800 



Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser 
805 810 815 



He Asn Lys Ala Met He Asn He Asn Lys Phe Leu Asn Gin Cys Ser 
820 825 830 



Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu 
. 865 - - -~. -840 c <• 845 
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Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr lie Tyr 
850 855 860 



Asp Asn Arg Gly Thr Leu lie Gly Gin Val Asp Arg Leu Lys Asp Lys 
865 870 875 880 



Val Asn Asn Thr Leu Ser Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr 
885 890 895 



Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 



<210> 62 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > Synthetic 
<400> 62 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctgacga tgacgataaa 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 63 
<211> 2709 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 63 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 
cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 
tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 
tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 3 60 
ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 420 
cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 
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gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag 


ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattegcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtaeggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtcegg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atategcate 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgegtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


aeggcatcat 


tacctccaaa 


1320 


actaaatctg 


acgatgacga 


taaatttggc 


ggtttcacgg 


gcgcacgcaa 


ateagegegt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tageggeggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


aeggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacaegg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


eggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgetg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


acaaacaaao 


fcfccfcciacfccffc 


araaarratr 


era c* a a c err* cr c 


t* cr a cr c a a a cer 


t-cici^ycictciclct 


Z Z Z V 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc 


aggeggaage 


taccaaggca 


2340 
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atcattaact accagtacaa ccagtacacc gaggaagaaa aaaacaacat caacttcaac 2400 

atcgacgatc tgtcctctaa actgaacgaa tccatcaaca aagctatgat caacatcaac 2460 

aagttcctga accagtgctc tgtaagctat ctgatgaact ccatgatccc gtacggtgtt 2520 

aaacgtctgg aggacttcga tgcgtctctg aaagacgccc tgctgaaata catttacgac 2580 

aaccgtggca ctctgatcgg tcaggttgat cgtctgaagg acaaagtgaa caatacctta 2640 

tcgaccgaca tcccttttca gctcagtaaa tatgtcgata accaacgcct tttgtccact 2700 
ctagactag 



<210> 64 

<211> 902 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 64 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 is 

Asn Gly Val Asp He Ala Tyr lie Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He- His Asn Lys He Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 .55 60 

Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 

Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 

Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 no 

He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 

Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



2709 
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Ser Tyr Arg Ser Glu GiiTLeu" Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 " 310 315 ' 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu. Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 ^ " 375 380 
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Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 " " 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 „ . . 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp Lys 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn 
515 520 525 



lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn lie Ser lie Glu Asn Leu Ser Ser Asp lie lie 
545 550 555 ~ 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 • 



Lys - Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr 'Glu Ala Ala Met Phe Leu Gly 
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625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 

865 ■ ■ *- ~-8-70~r ; 875" * ' 880 



WO 2006/059093 Best Available Copy 1Q2 



PCT/GB2005/004585 



Ser Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 65 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 65 

ggatccacgc acgtcgacgg catcattacc tccaaaacta aatctctgat agaaggtaga 60 
tttggcggtt tcacgggcgc acgcaaatca gcgcgtaaac gtaagaacca ggcgctagcg 120 
ggcggtggcg gtagcggcgg tggcggtagc ggcggtggcg gtagcgcact agtgctgcag 180 
acgcacggtc tagaatgata aaagctt 207 



<210> 66 
<211> 2742 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 66 

ggatccgaat tcatgccgat caccatcaac aacttcaact acagcgatcc ggtggataac 60 

aaaaacatcc tgtacctgga tacccatctg aataccctgg cgaacgaacc ggaaaaagcg 120 

tttcgtatca ccggcaacat ttgggttatt ccggatcgtt ttagccgtaa cagcaacccg 180 

aatctgaata aaccgccgcg tgttaccagc ccgaaaagcg gttattacga tccgaactat 240 

ctgagcaccg atagcgataa agataccttc ctgaaagaaa tcatcaaact gttcaaacgc 3 00 

atcaacagcc gtgaaattgg cgaagaactg atctatcgcc tgagcaccga tattccgttt 360 

ccgggcaaca acaacacccc gatcaacacc tttgatttcg atgtggattt caacagcgtt 42 0 

gatgttaaaa cccgccaggg taacaattgg gtgaaaaccg gcagcattaa cccgagcgtg 480 

attattaccg gtccgcgcga aaacattatt gatccggaaa ccagcacctt taaactgacc 540 

aacaacacct ttgcggcgca ggaaggtttt ggcgcgctga gcattattag cattagcccg 600 

cgctttatgc tgacctatag caacgcgacc aacgatgttg gtgaaggccg tttcagcaaa 660 

agcgaatttt gcatggaccc gatcctgatc ctgatgcatg aactgaacca tgcgatgcat 720 
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aacctgtatg gcatcgcgat tccgaacgat cagaccatta gcagcgtgac cagcaacatc 780 

ttttacagcc agtacaacgt gaaactggaa tatgcggaaa tctatgcgtt tggcggtccg 840 

accattgatc tgattccgaa aagcgcgcgc aaatacttcg aagaaaaagc gctggattac 900 

tatcgcagca ttgcgaaacg tctgaacagc attaccaccg cgaatccgag cagcttcaac 960 

aaatatatcg gcgaatataa acagaaactg atccgcaaat atcgctttgt ggtggaaagc 1020 

agcggcgaag ttaccgttaa ccgcaataaa ttcgtggaac tgtacaacga actgacccag 1080 

atcttcaccg aatttaacta tgcgaaaatc tataacgtgc agaaccgtaa aatctacctg 1140 

agcaacgtgt ataccccggt gaccgcgaat attctggatg ataacgtgta cgatatccag 1200 

aacggcttta acatcccgaa aagcaacctg aacgttctgt ttatgggcca gaacctgagc 1260 

cgtaatccgg cgctgcgtaa agtgaacccg gaaaacatgc tgtacctgtt caccaaattt 1320 

tgcgtcgacg gcatcattac ctccaaaact aaatctctga tagaaggtag atttggcggt 1380 

ttcacgggcg cacgcaaatc agcgcgtaaa cgtaagaacc aggcgctagc gggcggtggc 1440 

ggtagcggcg gtggcggtag cggcggtggc ggtagcgcac tagtgctgca gtgtcgtgaa 1500 

ctgctggtga aaaacaccga tctgccgttt attggcgata tcagcgatgt gaaaacqgat 1560 

atcttcctgc gcaaagatat caacgaagaa accgaagtga tctactaccc ggataacgtg 1620 

agcgttgatc aggtgatcct gagcaaaaac accagcgaac atggtcagct ggatctgctg 1680 

tatccgagca ttgatagcga aagcgaaatt ctgccgggcg aaaaccaggt gttttacgat 1740 

aaccgtaccc agaacgtgga ttacctgaac agctattact acctggaaag ccagaaactg 1800 

agcgataacg tggaagattt tacctttacc cgcagcattg aagaagcgct ggataacagc 1860 

gcgaaagttt acacctattt tccgaccctg gcgaacaaag ttaatgcggg tgttcagggc 192 0 

ggtctgtttc tgatgtgggc gaacgatgtg gtggaagatt tcaccaccaa catcctgcgt 1980 

aaagataccc tggataaaat cagcgatgtt agcgcgatta ttccgtatat tggtccggcg 204 0 

ctgaacatta gcaatagcgt gcgtcgtggc aattttaccg aagcgtttgc ggttaccggt 2100 

gtgaccattc tgctggaagc gtttccggaa ■ tttaccattc cggcgctggg tgcgtttgtg 2160 

atctatagca a&gtgcagga acgcaacgaa atcatcaaaa ccatcgataa ctgcctggaa 2220 

cagcgtatta aacgctggaa agatagctat gaatggatga tgggcacctg gctgagccgt 2280 

attatcaccc agttcaacaa catcagctac cagatgtacg atagcctgaa ctatcaggcg 2340 

ggtgcgatta aagcgaaaat cgatctggaa tacaaaaaat acagcggcag cgataaagaa 2400 

aacatcaaaa gccaggttga aaacctgaaa aacagcctgg atgtgaaaat tagcgaagcg 2460 

atgaataaca tcaacaaatt catccgcgaa tgcagcgtga cctacctgtt caaaaacatg 2520 

ctgccgaaag tgatcgatga actgaacgaa tttgatcgca acaccaaagc gaaactgatc 2580 
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aacctgatcg atagccacaa cattattctg gtgggcgaag tggataaact gaaagcgaaa 2640 
gttaacaaca gcttccagaa caccatcccg tttaacatct tcagctatac caacaacagc 2700 
ctgctgaaag atatcatcaa cgaatacttc aatctagact ag 2742 



<210> 


67 


<211> 


913 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


67 



Gly Ser Glu Phe Met Pro lie Thr lie Asn Asn Phe Asn Tyr Ser Asp 
15 10 15 



Pro Val Asp Asn Lys Asn lie Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg lie Thr Gly Asn lie Trp 
35 40 45 



Val lie Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu lie lie Lys 
85 90 95 



Leu Phe Lys Arg lie Asn Ser Arg Glu lie Gly Glu Glu Leu lie Tyr 
100 105 110 



Arg Leu Ser Thr Asp lie Pro Phe Pro Gly Asn Asn Asn Thr Pro lie 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
13 0 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
. . 165 170 175 
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Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser lie He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly He Ala He Pro Asn Asp Gin Thr He Ser Ser Val 
245 250 255 



Thr Ser Asn He Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp lie Gin 
385 390 395 ^ 400 



Asn Gly Phe Asn He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 
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Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 ^ 430 



Met Leu Tyr Leu Phe Thr Lys Phe Cys Val Asp Gly lie lie Thr Ser 
435 440 445 



Lys Thr Lys Ser Leu lie Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
450 455 460 



Arg Lys Ser Ala Arg Lys Arg Lys Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe lie Gly 
500 505 510 



Asp lie Ser Asp Val Lys Thr Asp lie Phe Leu Arg Lys Asp lie Asn 
515 520 525 



Glu Glu Thr Glu Val lie Tyr Tyr Pro Asp Asn Val Ser Val Asp Gin 
530 535 540 



Val lie Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu Asp Leu Leu 
545 550 555 560 



Tyr Pro Ser lie Asp Ser Glu Ser Glu lie Leu Pro Gly Glu Asn Gin 
565 570 575 



Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu Asn Ser Tyr 
580 585 590 



Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu Asp Phe Thr 
595 600 605 



Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala Lys Val Tyr 
610 615 620 



Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val Gin Gly 
625 630 635 640 



Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe Thr Thr 
645 650 655 



Asn He Leu Arg Lys Asp Thr. Leu Asp Lys He Ser Asp Val Ser Ala 
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660 665 670 



He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn Ser Val Arg 
675 680 685 



Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr He Leu 
690 695 700 



Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly Ala Phe Val 
705 710 715 720 



He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys Thr He Asp 
725 730 735 



Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser Tyr Glu Trp 
740 745 750 



Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe Asn Asn He 
755 760 765 



Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly Ala He Lys 
770 775 780 



Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu 
785 790 795 * 800 



Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys 
805 810 815 



He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser 
820 825 830 



Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu 
835 840 845 



Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn Leu He Asp 
850 855 860 



Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu Lys Ala Lys 
865 870 875 880 



Val Asn Asn Ser Phe Gin Asn Thr lie Pro Phe Asn He Phe Ser Tyr 
885 890 895 



Thr Asn Asn Ser Leu Leu Lys Asp He lie Asn Glu Tyr Phe Asn Leu 

— ~90O :: . • « r < c .v> 905 *x - : ■■ 910 
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Asp 



<210> 68 
<211> 2673 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic 










<400> bo 

ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtxuc 


tctccggact tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccacugcug 


ggcgctggta aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gau ucacgca 


ggccaccgcc tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


CaaCaCCaaC 


gcgtattacg agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


LLLLgycyyt 


cacgacgcta aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 




aagttcaaag atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacacccraao acaacttcat 


taaattcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatacggt 


ggtttcctgg cgctagcggg 


cggtggcggt 


1380 
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agcggcggtg gcggtagcgg 


cggtggcggt 


agcgcactag tgctgcagtg tatcaaggtt 


1440 


aacaactggg 


atttattctt 


cagcccgagt 


gaagacaact tcaccaacga cctgaacaaa 


1500 


ggtgaagaaa tcacctcaga 


tactaacatc 


gaagcagccg aagaaaacat ctcgctggac 


1560 


ctgatccagc 


agtactacct 


gacctttaat 


ttcgacaacg agccggaaaa catttctatc 


1620 


gaaaacctga gctctgatat 


catcggccag 


ctggaactga tgccgaacat cgaacgtttc 


1680 


ccaaacggta 


aaaagtacga 


gctggacaaa 


tataccatgt tccactacct gcgcgcgcag 


1740 


gaatttgaac 


acggcaaatc 


ccgtatcgca 


ctgactaact ccgttaacga agctctgctc 


1800 


aacccgtccc 


gtgtatacac 


cttcttctct 


agcgactacg tgaaaaaggt caacaaagcg 


1860 


actgaagctg 


caatgttctt 


gggttgggtt 


gaacagcttg tttatgattt taccgacgag 


1920 


acgtccgaag 


tatctactac 


cgacaaaatt 


gcggatatca ctatcatcat cccgtacatc 


1980 


ggtccggctc 


tgaacattgg 


caacatgctg 


tacaaagacg acttcgttgg cgcactgatc 


2040 


ttctccggtg 


cggtgatcct 


gctggagttc 


atcccggaaa tcgccatccc ggtactgggc 


2100 


acctttgctc 


tggtttctta 


cattgcaaac 


aaggttctga ctgtacaaac catcgacaac 


2160 


gcgctgagca 


aacgtaacga 


aaaatgggat 


gaagtttaca aatatatcgt gaccaactgg 


2220 


ctggctaagg 


ttaatactca 


gatcgacctc 


atccgcaaaa aaatgaaaga agcactggaa 


2280 


aaccaggcgg aagctaccaa 


ggcaatcatt 


aactaccagt acaaccagta caccgaggaa 


2340 


gaaaaaaaca 


acatcaactt 


caacatcgac 


gatctgtcct ctaaactgaa cgaatccatc 


2400 


aacaaagcta 


tgatcaacat 


caacaagttc 


ctgaaccagt gctctgtaag ctatctgatg 


2460 


aactccatga 


tcccgtacgg 


tgttaaacgt 


ctggaggact tcgatgcgtc tctgaaagac 


2520 


gccctgctga aatacattta 


*-*y c*.\-» u,u\^ y »— 


yyuau uyct ttyytcayy t uyai.cgL.Cug 


O EZ Q n 


aaggacaaag 


tgaacaatac 


cttatcgacc 


gacatccctt ttcagctcag taaatatgtc 


2640 


gataaccaac 


gccttttgtc 


cactctagac 


tag 


2673 



<210> 69 

<211> 890 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 69 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys He Pro Asn Ala, ..Gly Gin Met 
> . 20 25 30 
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Gin Pro Val Lys Ala Phe Lys lie His Asn Lys lie Trp Val lie Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265. ....... 270 
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Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Leu Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val 
465 470 475 480 



Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn 
485 490 495 



Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala 
500 505 510 
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Ala Glu Glu Asn lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr 
515 520 525 



Phe Asn Phe Asp Asn Glu Pro Glu Asn lie Ser lie Glu Asn Leu Ser 
530 535 540 



Ser Asp lie He Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe 
545 550 555 560 



Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr 
565 570 575 



Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr 
580 585 590 



Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe 
595 600 605 



Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala 
610 615 620 



Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu 
625 630 635 640 



Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala Asp He Thr He He 
645 650 655 



He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys 
660 665 670 



Asp Asp Phe Val Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu 
675 680 685 



Glu Phe He Pro Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu 
690 695 700 



Val Ser Tyr He Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn 
705 710 715 720 



Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He 
725 730 735 



Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg 
740 745 750 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala 
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755 760 765 



lie lie Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn 
770 775 780 



lie Asn Phe Asn lie Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser lie 
785 790 795 800 



Asn Lys Ala Met lie Asn lie Asn Lys Phe Leu Asn Gin Cys Ser Val 
805 810 815 



Ser Tyr Leu Met Asn Ser Met lie Pro Tyr Gly Val Lys Arg Leu Glu 
820 825 * " 830 



Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr lie Tyr Asp 
835 840 845 



Asn Arg Gly Thr Leu lie Gly Gin Val Asp Arg Leu Lys Asp Lys Val 
850 855 860 



Asn Asn Thr Leu Ser Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr Val 
865 870 875 ~ 880 



Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
885 890 



<210> 70 
<211> 2709 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 70 

ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa cggtgttgac 60 
attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 120 
cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga agaaggtgac 180 
ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc cacctacctg 240 
tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 300 
tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat cccgttctgg 360 
ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 42 0 
cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 480 
gatatrcatcc agttcgagtg taagagcttt: ggtcacgaag ttctgaacct cacccgtaac 540 
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ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact 


tcaccttcgg 


ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta 


aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc 


tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg 


agatgtccgg 


tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta 


aattcatcga 


ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag 


atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt 


acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct 


ctgtagacaa 


gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatatggc 


ggtttcacgg 


gcgcacgcaa 


atcagcgcgt 


1380 


aaattagcta 


accaggcgct 


agcgggcggt 


ggcggtagcg 


gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 
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atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctaatccrcr 


tcaggttgat 


cgtctgaagg acaaagtgaa caatacctta 


O *± \J 


tcgaccgaca 


tcccttttca 


gctcagtaaa tatgtcgata accaacgcct 


tttgtccact 


2700 


ctagactag 










2709 



<210> 


71 


<211> 


902 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


71 



Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
1 5 10 15 



Asn Gly Val Asp lie Ala Tyr lie Lys lie Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 ~ 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 
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Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
. 180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys. Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
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385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Leu Ala Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys He Lys Val Asn Asn Trp Asp 
485 490 495 



Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn 
515 520 525 



He Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He lie 
545 550 555 ~ 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 

625 — 5 ■ -n630 u v ^ '.;r / ©35 - <~ ' * 640 
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Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie 
660 665 670 



Gly Pro Ala Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu lie Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690. 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin lie Asp Leu He Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn lie Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 

865 . 870 ...„. 875 _. 880 



WO 2006/059093 Best Available Co Py 1 19 



PCT/GB2005/004585 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 

Leu Leu Ser Thr Leu Asp 
900 

<210> 72 
<211> 2709 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 72 



ggatccatgg 


agttcgttaa 


caaacagttc aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg actagcatcg ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa ctgaacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


utuoi.LyuLy yy t*yt- t-yy t.a aatucyCaaC LyatCCtycy 


bou 


gttaccctgg 


ctcacgaact 


gattcatgca ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt cacgacgcta aattcatcga ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac aagttcaaag atatcgcatc caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct tctctccagt acatgaagaa cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc ggcaaattct ctgtagacaa gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt tacaccgaag acaacttcgt taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac ttcgacaagg cagtattcaa aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat ggtttcaacc tgcgtaacac caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc aacaacatga acttcacaaa actgaaaaac 


1260 


ttcactggtc* tgttcgagtt 


ttacaagctg ctgtgcgtcg acggcatcat tacctccaaa 


1320 
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actaaatctc 


tgatagaagg 


tagatatggc 


ggtttcacgg gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtatc 


aaggttaaca 


actgggattt 


attcttcagc 


1500 


ccgagtgaag 


acaacttcac 


caacgacctg 


aacaaaggtg 


aagaaatcac 


ctcagatact 


1560 


aacatcgaag 


cagccgaaga 


aaacatctcg 


ctggacctga 


tccagcagta 


ctacctgacc 


1620 


tttaatttcg 


acaacgagcc 


ggaaaacatt 


tctatcgaaa 


acctgagctc 


tgatatcatc 


1680 


ggccagctgg 


aactgatgcc 


gaacatcgaa 


cgtttcccaa 


acggtaaaaa 


gtacgagctg 


1740 


gacaaatata 


ccatgttcca 


ctacctgcgc 


gcgcaggaat 


ttgaacacgg 


caaatcccgt 


1800 


atcgcactga 


ctaactccgt 


taacgaagct 


ctgctcaacc 


cgtcccgtgt 


atacaccttc 


1860 


ttctctagcg 


actacgtgaa 


aaaggtcaac 


aaagcgactg 


aagctgcaat 


gttcttgggt 


1920 


tgggttgaac 


agcttgttta 


tgattttacc 


gacgagacgt 


ccgaagtatc 


tactaccgac 


1980 


aaaattgcgg 


atatcactat 


catcatcccg 


tacatcggtc 


cggctctgaa 


cattggcaac 


2040 


atgctgtaca 


aagacgactt 


cgttggcgca 


ctgatcttct 


ccggtgcggt 


gatcctgctg 


2100 


gagttcatcc 


cggaaatcgc 


catcccggta 


ctgggcacct 


ttgctctggt 


ttcttacatt 


2160 


gcaaacaagg 


ttctgactgt 


acaaaccatc 


gacaacgcgc 


tgagcaaacg 


taacgaaaaa 


2220 


tgggatgaag 


tttacaaata 


tatcgtgacc 


aactggctgg 


ctaaggttaa 


tactcagatc 


2280 


gacctcatcc 


gcaaaaaaat 


gaaagaagca 


ctggaaaacc 


aggcggaagc 


taccaaggca 


2340 


atcattaact 


accagtacaa 


ccagtacacc 


gaggaagaaa 


aaaacaacat 


caacttcaac 


2400 


atcgacgatc 


tgtcctctaa 


actgaacgaa 


tccatcaaca 


aagctatgat 


caacatcaac 


2460 


aagttcctga 


accagtgctc 


tgtaagctat 


ctgatgaact 


ccatgatccc 


gtacggtgtt 


2520 


aaacgtctgg 


aggacttcga 


tgcgtctctg 


aaagacgccc 


tgctgaaata 


catttacgac 


2580 


aaccgtggca 


ctctgatcgg 


tcaggttgat 


cgtctgaagg acaaagtgaa caatacctta 


2640 


tcgaccgaca 


tcccttttca 


gctcagtaaa 


tatgtcgata accaacgcct tttgtccact 


2700 


ctagactag 












2709 



<210> 73 

<211> 902 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 73 



Gly Ser Met Glu Phe Val Asn ~Lys Gin Phe- Asn 'Tyr -Lys* Asp Prfc Val 
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10 15 



Asn Gly Val Asp lie Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Tip Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 • 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 

* ' ■* 245 - * - 250 v> 255 
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Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val 
370 375 380 



Asn Tyr Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly He He Thr Ser Lys Thr Lys Ser Leu He Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 " 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp 

485..^ . 490 . . 495 
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Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn 
515 520 525 



lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn lie Ser lie Glu Asn Leu Ser Ser Asp lie lie 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn lie Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys lie Ala Asp lie Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly Ala Leu He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala lie Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys 
725 73.0 735 
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Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp 
740 745 750 



Leu Ala Lys Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys 
755 760 ~ 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn 
785 790 795 800 



He Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met 
805 810 815 



He Asn He Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu He Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Asp 
900 



<210> 74 
<211> 2889 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 74 

ggatccttgg tacgagatga cgttgactat caaattttcc gcgactttgc ggaaaataaa 60 
ggtaagtttt tcgtcggcgc cacagacctg tccgtcaaaa ataagagagg ccagaacatc 120 
ggtaacgcac tgagcaacgt ccctatgatt gattttagtg tagcggacgt taataaacgg 180 
attgcaaccg tcgttgatcc gcagtatgct gtcagcgtca aacatgctaa agcggaagtt 240 
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catacgttct 


attacgggca 


atataacggc 


cataacgatg 


tggctgataa 


agaaaatgaa 


300 


tatcgcgtgg 


tcgagcagaa 


caattacgaa 


ccgcacaaag 


cgtggggcgc 


gagtaattta 


360 


ggccgcctgg 


aggactataa 


catggcccgt 


ttcaataaat 


tcgtgaccga 


ggtagcaccg 


420 


atcgccccca 


cagatgctgg 


tgggggcctg 


gatacctaca 


aagataaaaa 


ccgcttctct 


480 


agcttcgtgc 


gcattggcgc 


cggtcgtcag 


ctcgtgtacg 


agaagggtgt 


ctatcaccag 


540 


gaaggtaatg 


aaaaggggta 


cgacctccgt 


gatttgtccc 


aggcgtatcg 


ctacgctatt 


600 


gccggaaccc 


cgtataaaga 


tattaatatc 


gatcaaacca 


tgaataccga 


aggcctaatt 


660 


ggtttcggga 


atcataataa 


gcaatatagc 


gcagaagagc 


taaagcaggc 


cctcagccaa 


720 


gatgcgttaa 


ccaattacgg 


agtgttaggc 


gatagcggca 


gtccgctgtt 


tgccttcgat 


780 


aaacagaaaa 


atcaatgggt 


gtttctgggc 


acttatgatt 


attgggccgg 


atatggtaaa 


840 


aagagctggc 


aggaatggaa 


tatttataaa 


aaggaattcg 


cagacaaaat 


caagcagcat 


900 


gacaacgcag 


gtacggtgaa 


ggggaacggc 


gaacatcact 


ggaagacgac 


cggcacgaat 


960 


agtcatatcg 


gatcgacggc 


cgttcgcctg 


gcgaacaatg 


agggcgatgc 


aaacaatggg 


1020 


caaaacgtga 


cctttgagga 


caacggtacc 


ctggtcctta 


accagaacat 


aaatcagggc 


1080 


gcgggaggct 


tgttctttaa 


aggcgactat 


actgttaagg 


gagcaaacaa 


tgacatcacc 


1140 


tggttagggg 


ccggtattga 


cgttgcggat 


ggaaaaaagg 


tggtttggca 


ggttaaaaac 


1200 


cctaacgggg 


accggctggc 


aaaaatcggc 


aaagggacat 


tggaaattaa 


tggtaccggt 


1260 


gtgaatcagg 


gtcagctgaa 


agtgggagat 


gggaccgtga 


ttctgaacca 


gaaagcagac 


1320 


gctgacaaaa 


aggtgcaagc 


ctttagccaa 


gtaggaattg 


ttagtggtcg 


tggcacactc 


1380 


gtcttgaact 


caagcaacca 


aataaatccg 


gataacctgt 


actttggatt 


tcgtggcgga 


1440 


cgcctggatg 


ctaacgggaa 


tgatctgacc 


tttgaacata 


tccgtaacgt 


tgacgagggt 


1500 


gcgcgcatag 


ttaatcataa 


tactgaccat 


gcatcaacta 


tcaccttgac 


cgggaaaagt 


1560 


ctgattacaa 


acccaaactc 


tctgtcagta 


cattccatcc 


agaatgatta 


tgatgaagac 


1620 


gattactcat 


actattaccg 


gccgcgtaga 


ccaattccac 


aaggtaaaga 


tctttattac 


1680 


aaaaattacc 


gttattacgc 


attaaaatcc 


ggagggcggc 


tgaatgcacc 


tatgccggaa 


1740 


aatggcgtgg 


ccgaaaacaa 


tgactggatt 


tttatgggtt 


atactcaaga 


agaggctcgc 


1800 


aaaaatgcaa 


tgaaccataa 


aaataaccga 


aggatcggtg 


atttcggcgg 


atttttcgat 


1860 


gaggaaaatg 


gtaaaggtca 


caatggtgcg 


ctgaatctaa 


attttaacgg 


caaaagtgcc 


1920 


^ayaaaCy ll 


^ s*i rt 4- ^ ^ t-t ~\ ft 

L.CCU L.C LydC 


tggtggcgcu 


a a t* c t~ a a a t* a 


y nana i— o ciy 


uy Ly auy Lay 




ggtaacgtgc 


tgctttctgg 


ccggccaact 


ccgcatgcac 


gtgattttgt 


aaataaatcg 


2040 


agcgctcgta 


aagatgcgca 


tttttctaaa 


aataacgagg 


tcgtgtttga 


agatgactgg 


2100 
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ataaatcgca 


cctttaaagc 


ggcagaaatc gcggttaatc agagtgcgag cttttcatcg 


2160 


ggtaggaatg 


tatctgatat 


tacagcaaac attacagcca ctgataatgc gaaggtcaac 


2220 


ctgggttata 


aaaacggtga 


tgaagtttgt gttcgatcgg attacacggg ctatgttacc 


2280 


tgcaacactg 


gcaatctgtc 


tgataaagcg cttaactctt ttgacgccac gcgcattaac 


2340 


gggaatgtga 


acctgaacca 


aaacgctgcc ttggtacttg gtaaggccgc gttgtggggt 


2400 


aaaattcagg 


gccagggcaa 


ctcccgtgtg tctctgaacc agcactcgaa gtggcacctg 


2460 


acgggggact 


cgcaggtgca 


caacttgtcc ctggccgata gccatattca ccttaacaat 


2520 


gcgtccgatg 


cccagtcagc 


taataaatat catacgatca aaatcaatca cctctctggc 


2580 


aacggtcact 


ttcactactt 


aacggattta gcaaaaaact taggggataa agtcctggta 


2640 


aaagaatcag 


cgagcggaca 


ttatcagtta catgtacaga acaaaacagg cgagccaaat 


2700 


caggaaggcc 


ttgacttatt 


tgatgcttca tcggtacaag atcgttccag actgttcgtt 


2760 


tcactcgcga 


atcactacgt 


tgatctgggt gcgctgcgct atactataaa gacggaaaat 


2820 


ggcataacac 


gcctctataa 


tccctatgcc ggtaacggcc gtccggtgaa acctgctccc 


2880 


tgcgtcgac 






2889 



<210> 75 

<211> 4296 

<212> DNA 

<213> Artificial Sequence 
<220>. 

<223> Synthetic 

<400> 75 



ggatccttgg 


tacgagatga 


cgttgactat caaattttcc gcgactttgc ggaaaataaa 


60 


ggtaagtttt 


tcgtcggcgc 


cacagacctg tccgtcaaaa ataagagagg ccagaacatc 


120 


ggtaacgcac 


tgagcaacgt 


ccctatgatt gattttagtg tagcggacgt taataaacgg 


180 


attgcaaccg 


tcgttgatcc 


gcagtatgct gtcagcgtca aacatgctaa agcggaagtt 


240 


catacgttct 


attacgggca 


atataacggc cataacgatg tggctgataa agaaaatgaa 


300 


tatcgcgtgg 


tcgagcagaa 


caattacgaa ccgcacaaag cgtggggcgc gagtaattta 


360 


ggccgcctgg 


aggactataa 


catggcccgt . ttcaataaat tcgtgaccga ggtagcaccg 


420 


atcgccccca 


cagatgctgg 


tgggggcctg gatacctaca aagataaaaa ccgcttctct 


480 


agcttcgtgc 


gcattggcgc 


cggtcgtcag ctcgtgtacg agaagggtgt ctatcaccag 


540 


gaaggtaatg 


aaaaggggta 


cgacctccgt gatttgtccc aggcgtatcg ctacgctatt 


600 


gccggaaccc 


cgtataaaga 


tattaatatc gatcaaacca tgaataccga aggcctaatt 


660 
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ggtttcggga 


atcataataa 


gcaatatagc 


gcagaagagc 


taaagcaggc 


cctcagccaa 


720 


gatgcgttaa 


ccaattacgg 


agtgttaggc 


gatagcggca 


gtccgctgtt 


tgccttcgat 


780 


aaacagaaaa 


atcaatgggt 


gtttctgggc 


acttatgatt 


attgggccgg 


atatggtaaa 


840 


aagagctggc 


aggaatggaa 


tatttataaa 


aaggaattcg 


cagacaaaat 


caagcagcat 


900 


gacaacgcag 


gtacggtgaa 


ggggaacggc 


gaacatcact 


ggaagacgac 


cggcacgaat 


960 


agtcatatcg 


gatcgacggc 


cgttcgcctg 


gcgaacaatg 


agggcgatgc 


aaacaatggg 


1020 


caaaacgtga 


cctttgagga 


caacggtacc 


ctggtcctta 


accagaacat 


aaatcagggc 


1080 


gcgggaggct 


tgttctttaa 


aggcgactat 


actgttaagg 


gagcaaacaa 


tgacatcacc 


1140 


tggttagggg 


ccggtattga 


cgttgcggat 


ggaaaaaagg 


tggtttggca 


ggttaaaaac 


1200 


cctaacgggg 


accggctggc 


aaaaatcggc 


aaagggacat 


tggaaattaa 


tggtaccggt 


1260 


gtgaatcagg 


gtcagctgaa 


agtgggagat 


gggaccgtga 


ttctgaacca 


gaaagcagac 


1320 


gctgacaaaa 


aggtgcaagc 


ctttagccaa 


gtaggaattg 


ttagtggtcg 


tggcacactc 


1380 


gtcttgaact 


caagcaacca 


aataaatccg 


gataacctgt 


actttggatt 


tcgtggcgga 


1440 


cgcctggatg 


ctaacgggaa 


tgatctgacc 


tttgaacata 


tccgtaacgt 


tgacgagggt 


1500 


gcgcgcatag 


ttaatcataa 


tactgaccat 


gcatcaacta 


tcaccttgac 


cgggaaaagt 


1560 


ctgattacaa 


acccaaactc 


tctgtcagta 


cattccatcc 


agaatgatta 


tgatgaagac 


1620 


gattactcat 


actattaccg 


gccgcgtaga 


ccaattccac 


aaggtaaaga 


tctttattac 


1680 


aaaaattacc 


gttattacgc 


attaaaatcc 


ggagggcggc 


tgaatgcacc 


tatgccggaa 


1740 


aatggcgtgg 


ccgaaaacaa 


tgactggatt 


tttatgggtt 


atactcaaga 


agaggctcgc 


1800 


aaaaatgcaa 


tgaaccataa 


aaataaccga 


aggatcggtg 


atttcggcgg 


atttttcgat 


1860 


gaggaaaatg 


gtaaaggtca 


caatggtgcg 


ctgaatctaa 


attttaacgg 


caaaagtgcc 


1920 


cagaaacgtt 


tccttctgac 


tggtggcgct 


aatctgaatg 


gtaaaatcag 


tgtgacgcag 


1980 


ggtaacgtgc 


tgctttctgg 


ccggccaact 


ccgcatgcac 


gtgattttgt 


aaataaatcg 


2040 


agcgctcgta 


aagatgcgca 


tttttctaaa 


aataacgagg 


tcgtgtttga 


agatgactgg 


2100 


ataaatcgca 


cctttaaagc 


ggcagaaatc 


gcggttaatc 


agagtgcgag 


cttttcatcg 


2160 


ggtaggaatg 


tatctgatat 


tacagcaaac 


attacagcca 


ctgataatgc 


gaaggtcaac 


2220 


ctgggttata 


aaaacggtga 


tgaagtttgt 


gttcgatcgg 


attacacggg 


ctatgttacc 


2280 


tgcaacactg 


gcaatctgtc 


tgataaagcg 


cttaactctt 


ttgacgccac 


gcgcattaac 


2340 


gggaaugtga 


acctgaacca 


aaacgctgcc 


ttggtacttg 


gtaaggccgc 


gttgtggggt 


2400 


aaaattcagg 


gccagggcaa 


ctcccgtgtg 


tctctgaacc 


agcactcgaa 


gtggcacctg 


2460 


acgggggact 


cgcaggtgca 


caacttgtcc 


ctgg-ccgata Ogccatattca 


ccttaacaat 


2520 
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gcgtccgatg 


cccagtcagc 


taataaatat 


catacgatca 


aaatcaatca 


cctctctggc 


2580 


aacggtcact 


ttcactactt 


aacggattta gcaaaaaact 


taggggataa 


agtcctggta 


2640 


aaagaatcag 


cgagcggaca 


tLOLCoytUa 


cacguacaga 


acaaaacagg 


egagecaaat 


2700 


caggaaggcc 


ttgacttatt 


ugatgcu tea 


ccgguacaag 


atcgttccag 


actgttcgtt 


2760 


tcactcgcga 


atcactacgt 


tgatctgggt 


gcgctgcgct 


atactataaa 


gaeggaaaat 


2820 


ggcataacac 


gcctctataa 


tccctatgcc 


ggtaaeggee 


gtccggtgaa 


acctgctccc 


2880 


tgcgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


2940 


ttcacgggcg 


cacgcaaatc 


agegegtaaa 


egtaagaace 


aggegctage 


gggcggtggc 


3000 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagegcac 


tagtgetgea 


gtgtatcaag 


3060 


gttaacaact 


gggatttatt 


cttcagcccg agtgaagaca 


acttcaccaa 


cgacctgaac 


3120 


aaaggtgaag 


aaatcacctc 


agatactaac 


atcgaagcag 


ccgaagaaaa 


catctcgctg 


3180 


gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


aegagcegga 


aaacatttct 


3240 


atcgaaaacc 


tgagctctga 


tatcategge 


cagctggaac 


tgatgccgaa 


catcgaacgt 


3300 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


tgttccacta 


cctgcgcgcg 


3360 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


3420 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tetagegact 


acgtgaaaaa 


ggtcaacaaa 


3480 


gcgactgaag 


ctgcaatgtt 


cttgggttgg gttgaacagc 


ttgtttatga 


ttttaccgac 


3540 


gagacgtccg 


aagtatctac 


taccgacaaa attgeggata 


tcactatcat 


catcccgtac 


3600 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


aegacttegt 


tggcgcactg 


3660 


atcttctccg 


gtgcggtgat 


ectgetggag 


ttcatcccgg 


aaatcgecat 


cccggtactg 


3720 


ggcacctttg 


ctctggtttc 


ttacattgea 


aacaaggttc 


tgactgtaca 


aaccatcgac 


3780 


aacgcgctga 


gcaaacgtaa 


cgaaaaatgg gatgaagttt 


acaaatatat 


cgtgaccaac 


3840 


tggctggcta 


aggttaatac 


tcagatcgac 


ctcatccgca 


aaaaaatgaa 


agaagcactg 


3900 


gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc 


agtacaacca 


gtacaccgag 


3960 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt 


cctctaaact 


gaacgaatcc 


4020 


atcaacaaag 


ctatgatcaa 


catcaacaag ttcctgaacc 


agtgctctgt 


aagctatctg 


4080 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg 


acttcgatgc 


gtctctgaaa 


4140 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc 


tgateggtea 


ggttgatcgt 


4200 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc 


cttttcagct 


cagtaaatat 


4260 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 






4296 
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<Z1U> 


/ o 


<211> 


1431 


<212> 


PRT 


<Z ± J > 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


76 



Gly Ser Leu Val Arg Asp Asp Val Asp Tyr Gin He Phe Arg Asp Phe 
15 10 15 



Ala Glu Asn Lys Gly Lys Phe Phe Val Gly Ala Thr Asp Leu Ser Val 
20 2.5 ^ 30 



Lys Asn Lys Arg Gly Gin Asn He Gly Asn Ala Leu Ser Asn Val Pro 
35 40 45 



Met He Asp Phe Ser Val Ala Asp Val Asn Lys Arg He Ala Thr Val 
50 55 60 



Val Asp Pro Gin Tyr Ala Val Ser Val Lys His Ala Lys Ala Glu Val 
65 70 75 80 



His Thr Phe Tyr Tyr Gly Gin Tyr Asn Gly His Asn Asp Val Ala Asp 
85 90 95 



Lys Glu Asn Glu Tyr Arg Val Val Glu Gin Asn Asn Tyr Glu Pro His 
100 105 110 



Lys Ala Trp Gly Ala Ser Asn Leu Gly Arg Leu Glu Asp Tyr Asn Met 
115 120 125 



Ala Arg Phe Asn Lys Phe Val Thr Glu Val Ala Pro He Ala Pro Thr 
130 135 140 



Asp Ala Gly Gly Gly Leu Asp Thr Tyr Lys Asp Lys Asn Arg Phe Ser 
145 150 155 160 



Ser Phe Val Arg He Gly Ala Gly Arg Gin Leu Val Tyr Glu Lys Gly 
165 170 ~ 175 



Val Tyr His Gin Glu Gly Asn Glu Lys Gly Tyr Asp Leu Arg Asp Leu 
180 185 190 



Ser Gin Ala Tyr Arg Tyr Ala He Ala Gly Thr Pro Tyr Lys Asp He 

195- : . * vi-' « 200> . - 205 
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Asn lie Asp Gin Thr Met Asn Thr Glu Gly Leu lie Gly Phe Gly Asn 
210 215 ~ 220 



His Asn Lys Gin Tyr Ser Ala Glu Glu Leu Lys Gin Ala Leu Ser Gin 
225 230 235 240 



Asp Ala Leu Thr Asn Tyr Gly Val Leu Gly Asp Ser Gly Ser Pro Leu 
245 250 A 255 



Phe Ala Phe Asp Lys Gin Lys Asn Gin Trp Val Phe Leu Gly Thr Tyr 
260 265 270 



Asp Tyr Trp Ala Gly Tyr Gly Lys Lys Ser Trp Gin Glu Trp Asn He 
275 280 285 



Tyr Lys Lys Glu Phe Ala Asp Lys He Lys Gin His Asp Asn Ala Gly 
290 295 300 



Thr Val Lys Gly Asn Gly Glu His His Trp Lys Thr Thr Gly Thr Asn 
305 310 315 320 



Ser His He Gly Ser Thr Ala Val Arg Leu Ala Asn Asn Glu Gly Asp 
325 330 335 



Ala Asn Asn Gly Gin Asn Val Thr Phe Glu Asp Asn Gly Thr Leu Val 
340 345 350 



Leu Asn Gin Asn He Asn Gin Gly Ala Gly Gly Leu Phe Phe Lys Gly 
355 360 365 



Asp Tyr Thr Val Lys Gly Ala Asn Asn Asp He Thr Trp Leu Gly Ala 
370 375 380 



Gly He Asp Val Ala Asp Gly Lys Lys Val Val Trp Gin Val Lys Asn 
385 ' 390 395 ~ 400 



Pro Asn Gly Asp Arg Leu Ala Lys He Gly Lys Gly Thr Leu Glu He 
405 410 415 



Asn Gly Thr Gly Val Asn Gin Gly Gin Leu Lys Val Gly Asp Gly Thr 
420 425 430 



Val He Leu Asn Gin Lys Ala Asp Ala Asp Lys Lys Val Gin Ala Phe 
435 440 445 
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Ser Gin Val Gly lie Val Ser Gly Arg Gly Thr Leu Val Leu Asn Ser 
450 455 460 



Ser Asn Gin lie Asn Pro Asp Asn Leu Tyr Phe Gly Phe Arg Gly Gly 
465 470 475 480 



Arg Leu Asp Ala Asn Gly Asn Asp Leu Thr Phe Glu His lie Arg Asn 
485 * 490 495 



Val Asp Glu Gly Ala Arg lie Val Asn His Asn Thr Asp His Ala Ser 
500 505 510 



Thr lie Thr Leu Thr Gly Lys Ser Leu lie Thr Asn Pro Asn Ser Leu 
515 520 525 



Ser Val His Ser lie Gin Asn Asp Tyr Asp Glu Asp Asp Tyr Ser Tyr 
530 535 540 



Tyr Tyr Arg Pro Arg Arg Pro lie Pro Gin Gly Lys Asp Leu Tyr Tyr 
545 550 555 560 



Lys Asn Tyr Arg Tyr Tyr Ala Leu Lys Ser Gly Gly Arg Leu Asn Ala 
565 570 575 



Pro Met Pro Glu Asn Gly Val Ala Glu Asn Asn Asp Trp lie Phe Met 
580 585 590 



Gly Tyr Thr Gin Glu Glu Ala Arg Lys Asn Ala Met Asn His Lys Asn 
595 600 605 



Asn Arg Arg lie Gly Asp Phe Gly Gly Phe Phe Asp Glu Glu Asn Gly 
610 615 620 



Lys Gly His Asn Gly Ala Leu Asn Leu Asn Phe Asn Gly Lys Ser Ala 
625 630 635 640 



Gin Lys Arg Phe Leu Leu Thr Gly Gly Ala Asn Leu Asn Gly Lys lie 
645 650 655 



Ser Val Thr Gin Gly Asn Val Leu Leu Ser Gly Arg Pro Thr Pro His 
660 665 670 



Ala Arg Asp Phe Val Asn Lys Ser Ser Ala Arg Lys Asp Ala His Phe 
675 680 685 
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Ser Lys Asn Asn Glu Val Val Phe Glu Asp Asp Trp lie Asn Arg Thr 
690 695 700 



Phe Lys Ala Ala Glu lie Ala Val Asn Gin Ser Ala Ser Phe Ser Ser 
705 710 715 720 



Gly Arg Asn Val Ser Asp lie Thr Ala Asn lie Thr Ala Thr Asp Asn 
725 730 735 



Ala Lys Val Asn Leu Gly Tyr Lys Asn Gly Asp Glu Val Cys Val Arg 
740 " ' 745 750 



Ser Asp Tyr Thr Gly Tyr Val Thr Cys Asn Thr Gly Asn Leu Ser Asp 
755 760 765 



Lys Ala Leu Asn Ser Phe Asp Ala Thr Arg lie Asn Gly Asn Val Asn 
770 775 780 



Leu Asn Gin Asn Ala Ala Leu Val Leu Gly Lys Ala Ala Leu Trp Gly 
785 790 795 800 



Lys lie Gin Gly Gin Gly Asn Ser Arg Val Ser Leu Asn Gin His Ser 
805 810 815 



Lys Trp His Leu Thr Gly Asp Ser Gin Val His Asn Leu Ser Leu Ala 
820 825 830 



Asp Ser His lie His Leu Asn Asn Ala Ser Asp Ala Gin Ser Ala Asn 
835 840 845 



Lys Tyr His Thr lie Lys lie Asn His Leu Ser Gly Asn Gly His Phe 
850 855 860 



His Tyr Leu Thr Asp Leu Ala Lys Asn Leu Gly Asp Lys Val Leu Val 
865 870 875 880 



Lys Glu Ser Ala Ser Gly His Tyr Gin Leu His Val Gin Asn Lys Thr 
885 " 890 895 



Gly Glu Pro Asn Gin. Glu Gly Leu Asp Leu Phe Asp Ala Ser Ser Val 
900 905 910 



Gin Asp Arg Ser Arg Leu Phe Val Ser Leu Ala Asn His Tyr Val Asp 
915 920 925 



Leu Gly Ala Leu Arg Tyr Thr He Lys Thr Glu Asn Gly He Thr Arg 
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930 935 940 



Leu Tyr Asn Pro Tyr Ala Gly Asn Gly Arg Pro Val Lys Pro Ala Pro 
945 950 955 960 



Cys Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly 
965 970 975 



Arg Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys 
980 985 990 



Asn Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
995 1000 " 1005 



Gly Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn 
1010 1015 1020 



Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp 
1025 1030 1035 



Leu Asn Lys Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala 
1040 1045 1050 



Ala Glu Glu Asn lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu 
1055 1060 1065 



Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn lie Ser lie Glu Asn 
1070 1075 1080 



Leu Ser Ser Asp lie lie Gly Gin Leu Glu Leu Met Pro Asn lie 
1085 1090 1095 



Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr 
1100 1105 1110 



Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
1115 1120 1125 



Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro 
1130 1135 1140 



Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val 
1145 1150 1155 



Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin 
1160 11-65 '> " n -1170 
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Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
1175 1180 1185 



Asp Lys lie Ala Asp lie Thr lie lie lie Pro Tyr lie Gly Pro 
1190 1195 1200 



Ala Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly 
1205 1210 1215 



Ala Leu lie Phe Ser Gly Ala Val lie Leu Leu Glu Phe lie Pro 
1220 1225 1230 



Glu lie Ala lie Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr 
1235 1240 1245 



lie Ala Asn Lys Val Leu Thr Val Gin Thr lie Asp Asn Ala Leu 
1250 1255 1260 



Ser Lys Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr lie Val 
1265 1270 1275 



Thr Asn Trp Leu Ala Lys Val Asn Thr Gin lie Asp Leu lie Arg 
1280 1285 1290 



Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys 
1295 1300 1305 



Ala lie lie Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys 
1310 1315 1320 



Asn Asn lie Asn Phe Asn lie Asp Asp Leu Ser Ser Lys Leu Asn 
1325 1330 1335 



Glu Ser lie- Asn Lys Ala Met lie. Asn lie Asn Lys Phe Leu Asn 
1340 1345 1350 



Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met lie Pro Tyr Gly 
1355 1360 1365 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu 
1370 * 1375 1380 



Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie Gly Gin Val 
1385 . ■ . . _ .1390 . 1395 
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Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp He 
1400 1405 1410 

Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
1415 1420 1425 

Thr Leu Asp 
1430 

<210> 77 

<211> 1357 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 77 
gctagcgggc 


ggtggcggta gcggcggtgg 


cggtagcggc ggtggcggta gcgcactagt 


60 


gctgcagtgt 


atcaaggtta acaactggga 


tttattcttc agcccgagtg aagacaactt 


120 


caccaacgac 


ctgaacaaag gtgaagaaat 


cacctcagat actaacatcg aagcagccga 


180 


agaaaacatc 


tcgctggacc tgatccagca 


gtactacctg acctttaatt tcgacaacga 


240 


gccggaaaac 


atttctatcg aaaacctgag 


ctctgatatc atcggccagc tggaactgat 


300 


gccgaacatc 


gaacgtttcc caaacggtaa 


aaagtacgag ctggacaaat ataccatgtt 


360 


ccactacctg 


cgcgcgcagg aatttgaaca 


cggcaaatcc cgtatcgcac tgactaactc 


420 


cgttaacgaa 


gctctgctca acccgtcccg 


tgtatacacc ttcttctcta gcgactacgt 


480 


gaaaaaggtc 


aacaaagcga ctgaagctgc 


aatgttcttg ggttgggttg aacagcttgt 


540 


ttatgatttt 


accgacgaga cgtccgaagt 


atctactacc gacaaaattg cggatatcac 


600 


tatcatcatc 


ccgtacatcg gtccggctct 


gaacattggc aacatgctgt acaaagacga 


660 


cttcgttggc 


gcactgatct tctccggtgc 


ggtgatcctg ctggagttca tcccggaaat 


720 


cgccatcccg 


gtactgggca cctttgctct 


ggtttcttac attgcaaaca aggttctgac 


780 


tgtacaaacc 


atcgacaacg cgctgagcaa 


acgtaacgaa aaatgggatg aagtttacaa 


840 


atatatcgtg 


accaactggc tggctaaggt 


taatactcag atcgacctca tccgcaaaaa 


900 


aatgaaagaa 


gcactggaaa accaggcgga 


agctaccaag gcaatcatta actaccagta 


960 


caaccagtac 


accgaggaag aaaaaaacaa 


catcaacttc aacatcgacg atctgtcctc 


1020 


taaactgaac 


gaatccatca acaaagctat 


gatcaacatc aacaagttcc tgaaccagtg 


1080 


ctctgtaagc 


tatctgatga actccatgat 


cccgtacggt gttaaacgtc tggaggactt 


1140 


-cgatgcgtct 


ctgaaagacg ;ccctgctgaa 


atacatttac gacaaccgtg gcactctgat 


1200 
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l. y y u. c a.y y u u 




«yy«co.a.cty t. 




CILaLUtLL. U U 


1 960 

JL D \J 


tcagctcagt 


aaatatgtcg 


ataaccaacg 


ccttttgtcc actctagaaa 


tagaaggtag 


1320 


aagtgggcac 


catcaccatc 


accattaatg 


aaagctt 




1357 


<210> 78 
<211> 2745 
<212> DNA 

<213> Artificial Sequence 








<220> 

<223> Synthetic 










<400> 78 
ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag acccagttaa 


cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc cggtaaaggc 


attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta ctaacccgga 


agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt actatgactc 


cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta ctaaactgtt 


cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg ttcgcggtat 


cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca ctaactgcat 


caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg tgatcatcgg 


cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag ttctgaacct 


cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta aattcgcaac 


tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc tgtacggtat 


cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta aattcatcga ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag atatcgcatc 


caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt acatgaagaa 


cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct ctgtagacaa gttgaaattc 


1020 


gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


.ttacaagctg 


ctgtgcgtcg acggcatcat 


tacctccaaa 


1320 
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actaaatctc tgatagaagg tagatttggc ggtttcacgg gcgcacgcaa atcagcgcgt 1380 

aaacgtaaga accaggcgct agcgggcggt ggcggtagcg gcggtggcgg tagcggcggt 1440 

ggcggtagcg cactagtgct gcagtgtatc aaggttaaca actgggattt attcttcagc 1500 

ccgagtgaag acaacttcac caacgacctg aacaaaggtg aagaaatcac ctcagatact 1560 

aacatcgaag cagccgaaga aaacatctcg ctggacctga tccagcagta ctacctgacc 1620 

tttaatttcg acaacgagcc ggaaaacatt tctatcgaaa acctgagctc tgatatcatc 1680 

ggccagctgg aactgatgcc gaacatcgaa cgtttcccaa acggtaaaaa gtacgagctg 1740 

gacaaatata ccatgttcca ctacctgcgc gcgcaggaat ttgaacacgg caaatcccgt 1800 

atcgcactga ctaactccgt taacgaagct ctgctcaacc cgtcccgtgt atacaccttc 1860 

ttctctagcg actacgtgaa aaaggtcaac aaagcgactg aagctgcaat gttcttgggt 1920 

tgggttgaac agcttgttta tgattttacc gacgagacgt ccgaagtatc tactaccgac 1980 

aaaattgcgg atatcactat catcatcccg tacatcggtc cggctctgaa cattggcaac 2040 

atgctgtaca aagacgactt cgttggcgca ctgatcttct ccggtgcggt gatcctgctg 2100 

gagttcatcc cggaaatcgc catcccggta ctgggcacct ttgctctggt ttcttacatt 2160 

gcaaacaagg ttctgactgt acaaaccatc gacaacgcgc tgagcaaacg taacgaaaaa 2220 

tgggatgaag tttacaaata tatcgtgacc aactggctgg ctaaggttaa tactcagatc 2280 

gacctcatcc gcaaaaaaat gaaagaagca ctggaaaacc aggcggaagc taccaaggca 2340 

atcattaact accagtacaa ccagtacacc gaggaagaaa aaaacaacat caacttcaac 2400 

atcgacgatc tgtcctctaa actgaacgaa tccatcaaca aagctatgat caacatcaac 2460 

aagttcctga accagtgctc tgtaagctat ctgatgaact ccatgatccc gtacggtgtt 2520 

aaacgtctgg aggacttcga tgcgtctctg aaagacgccc tgctgaaata catttacgac 2580 

aaccgtggca ctctgatcgg tcaggttgat cgtctgaagg acaaagtgaa caatacctta 2640 

tcgaccgaca tcccttttca gctcagtaaa tatgtcgata accaacgcct tttgtccact 2700 

ctagaaatag aaggtagaag tgggcaccat caccatcacc attaa 2745 

<210> 79 
<211> 914 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 79 

Gly Ser..Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lya Asp Pro Val 

1 : *- 5 . • r . r io j <• 15 
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Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 



Gin Pro Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro 
35 40 45 



Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro Glu Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu 
65 70 75 80 



Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



He Val Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu 
115 120 125 



Leu Lys Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala 
145 150 155 160 



Asp He He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 • 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 

245 ... „ 250 255 
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Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp 
260 265 270 



Ala Lys Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



lie Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 
305 310 315 320 



Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys lie Lys Val Asn Asn Trp Asp 
485 490 495 
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Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys 
500 505 510 



Gly Glu Glu lie Thr Ser Asp Thr Asn lie Glu Ala Ala Glu Glu Asn 
515 520 525 



lie Ser Leu Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp 
530 535 540 



Asn Glu Pro Glu Asn lie Ser He Glu Asn Leu Ser Ser Asp lie He 
545 550 555 560 



Gly Gin Leu Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys 
565 570 575 



Lys Tyr Glu Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin 
580 585 590 



Glu Phe Glu His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn 
595 600 605 



Glu Ala Leu Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp 
610 615 620 



Tyr Val Lys Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly 
625 630 635 640 



Trp Val Glu Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val 
645 650 655 



Ser Thr Thr Asp Lys He Ala Asp He Thr He He He Pro Tyr He 
660 665 670 



Gly Pro Ala Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val 
675 680 685 



Gly. Ala Leu lie Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro 
690 695 700 



Glu He Ala He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He 
705 710 715 720 



Ala Asn Lys Val Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Lys 
725 730 735 



Arg Asn Glu Lys Trp Asp Glu Val Tyr Lys Tyr lie Val Thr Asn Trp 
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740 745 750 



Leu Ala Lys Val Asn Thr Gin lie Asp Leu lie Arg Lys Lys Met Lys 
755 760 765 



Glu Ala Leu Glu Asn Gin Ala Glu Ala Thr Lys Ala lie lie Asn Tyr 
770 775 780 



Gin Tyr Asn Gin Tyr Thr Glu Glu Glu Lys Asn Asn lie Asn Phe Asn 
785 790 795 800 



lie Asp Asp Leu Ser Ser Lys Leu Asn Glu Ser lie Asn Lys Ala Met 
805 810 815 



lie Asn lie Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met 
820 825 830 



Asn Ser Met lie Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala 
835 840 845 



Ser Leu Lys Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr 
850 855 860 



Leu lie Gly Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu 
865 870 875 880 



Ser Thr Asp lie Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg 
885 890 895 



Leu Leu Ser Thr Leu Glu lie Glu Gly Arg Ser Gly His His His His 
900 905 910 



His His 



<210> 


80 


<211> 


619 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


80 



gctagcgggc ggtggcggta gcggcggtgg cggtagcggc ggtggcggta gcgcactagt 60 
gctgcagtgt atcaatctgg attgggacgt aatccgtgat aagaccaaaa caaaaatcga 120 



gtctbtgaaa gaacacggcc cgatc'aaaaa taagatgtct gaatcaccca ataaaactgt 180 
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ttcggaggaa aaagcgaaac agtatttgga agagtttcat caaaccgcgc ttgaacatcc 240 

ggagctcagt gaactgaaaa cagtgacggg aacgaatcct gtttttgcag gcgcaaacta 3 00 

tgcggcttgg gccgtgaatg ttgcccaagt aattgatagt gagaccgcag acaacctgga 360 

aaagacgacc gcagcgttaa gcattttacc ggggattggt tccgtgatgg gtatagcgga 42 0 

tggagcggtc caccataaca ctgaggaaat tgtcgcccag tcaatcgctc tgagttccct 480 

gatggttgca caggctatcc cactcgtggg ggaactggtt gacataggtt tcgccgccta 540 

caacttcgta gaaagcatta ttaatctttt tcaggtggtg cataacagct acaaccgccc 600 

tctagaatga taaaagctt 619 

<210> 81 

<211> 1971 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 81 



ggatccatgg 


agttcgttaa 


caaacagttc 


aactataaag acccagttaa cggtgttgac 


60 


attgcttaca 


tcaaaatccc 


gaacgctggc 


cagatgcagc cggtaaaggc attcaaaatc 


120 


cacaacaaaa 


tctgggttat 


cccggaacgt 


gataccttta ctaacccgga agaaggtgac 


180 


ctgaacccgc 


caccggaagc 


gaaacaggtg 


ccggtatctt actatgactc cacctacctg 


240 


tctaccgata 


acgaaaagga 


caactacctg 


aaaggtgtta ctaaactgtt cgagcgtatt 


300 


tactccaccg 


acctgggccg 


tatgctgctg 


actagcatcg ttcgcggtat cccgttctgg 


360 


ggcggttcta 


ccatcgatac 


cgaactgaaa 


gtaatcgaca ctaactgcat caacgttatt 


420 


cagccggacg 


gttcctatcg 


ttccgaagaa 


ctgaacctgg tgatcatcgg cccgtctgct 


480 


gatatcatcc 


agttcgagtg 


taagagcttt 


ggtcacgaag ttctgaacct cacccgtaac 


540 


ggctacggtt 


ccactcagta 


catccgtttc 


tctccggact tcaccttcgg ttttgaagaa 


600 


tccctggaag 


tagacacgaa 


cccactgctg 


ggcgctggta aattcgcaac tgatcctgcg 


660 


gttaccctgg 


ctcacgaact 


gattcatgca 


ggccaccgcc tgtacggtat cgccatcaat 


720 


ccgaaccgtg 


tcttcaaagt 


taacaccaac 


gcgtattacg agatgtccgg tctggaagtt 


780 


agcttcgaag 


aactgcgtac 


ttttggcggt 


cacgacgcta aattcatcga ctctctgcaa 


840 


gaaaacgagt 


tccgtctgta 


ctactataac 


aagttcaaag atatcgcatc caccctgaac 


900 


aaagcgaaat 


ccatcgtggg 


taccactgct 


tctctccagt acatgaagaa cgtttttaaa 


960 


gaaaaatacc 


tgctcagcga 


agacacctcc 


ggcaaattct ctgtagacaa gttgaaattc 


1020 
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gataaacttt 


acaaaatgct 


gactgaaatt 


tacaccgaag 


acaacttcgt 


taagttcttt 


1080 


aaagttctga 


accgcaaaac 


ctatctgaac 


ttcgacaagg 


cagtattcaa 


aatcaacatc 


1140 


gtgccgaaag 


ttaactacac 


tatctacgat 


ggtttcaacc 


tgcgtaacac 


caacctggct 


1200 


gctaatttta 


acggccagaa 


cacggaaatc 


aacaacatga 


acttcacaaa 


actgaaaaac 


1260 


ttcactggtc 


tgttcgagtt 


ttacaagctg 


ctgtgcgtcg 


acggcatcat 


tacctccaaa 


1320 


actaaatctc 


tgatagaagg 


tagatacggt 


ggtttcctgg 


cgctagcggg 


cggtggcggt 


1380 


agcggcggtg 


gcggtagcgg 


cggtggcggt 


agcgcactag 


tgctgcagtg 


tatcaatctg 


1440 


gattgggacg 


taatccgtga 


taagaccaaa 


acaaaaatcg 


agtctttgaa 


agaacacggc 


1500 


ccgatcaaaa 


ataagatgtc 


tgaatcaccc 


aataaaactg 


tttcggagga 


aaaagcgaaa 


1560 


cagtatttgg 


aagagtttca 


tcaaaccgcg 


cttgaacatc 


cggagctcag 


tgaactgaaa 


1620 


acagtgacgg 


gaacgaatcc 


tgtttttgca 


ggcgcaaact 


atgcggcttg 


ggccgtgaat 


1680 


gttgcccaag 


taattgatag 


tgagaccgca 


gacaacctgg 


aaaagacgac 


cgcagcgtta 


1740 


agcattttac 


cggggattgg 


ttccgtgatg 


ggtatagcgg 


atggagcggt 


ccaccataac 


1800 


at uy ayy dcici 




y uc^dci ucy C L. 


CLydy llccc 


l. ga u gg u t. g c 


ac aggc c at c 


lobU 


ccactcgtgg 


gggaactggt 


tgacataggt 


ttcgccgcct 


acaacttcgt 


agaaagcatt 


1920 


attaatcttt 


ttcaggtggt 


gcataacagc 


tacaaccgcc 


ctctagaatg 


a 


1971 



<210> 82 
<211> 656 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 82 ' 

Gly Ser Met Glu Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val 
15 .10 15 

Asn Gly Val Asp He Ala Tyr He Lys He Pro Asn Ala Gly Gin Met 
20 25 30 

Gin Pro Val Lys Ala Phe Lys He His Asn Lys lie Trp Val He Pro 
35 40 45 

Glu Arg Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro 
50 55 60 



Pro.. Glu. Ala Lys Gin Val Pro Val Ser Tyr JTyr 
' -65 — 70 75 



Asp. Ser Thr Tyr Leu 
80 
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Ser Thr Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 
85 90 95 



Phe Glu Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser 
100 105 110 



lie Val Arg Gly lie Pro Phe Trp Gly Gly Ser Thr lie Asp Thr Glu 
115 ' 120 125 



Leu Lys Val lie Asp Thr Asn Cys lie Asn Val lie Gin Pro Asp Gly 
130 135 140 



Ser Tyr Arg Ser Glu Glu Leu Asn Leu Val lie lie Gly Pro Ser Ala 
145 150 155 160 



Asp lie lie Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn 
165 170 175 



Leu Thr Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro 
180 185 190 



Asp Phe Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro 
195 200 205 



Leu Leu Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala 
210 215 220 



His Glu Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn 
225 230 235 240 



Pro Asn Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser 
' 245 250 255 



Gly Leu Glu Val Ser Phe Glu Glu Leu Arg Thr Phe" Gly Gly His Asp 
260 265 270 



Ala Lys Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr 
275 280 285 



Tyr Asn Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser 
290 295 300 



He Val Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys 

305 . 310 .._ _ .315 - . . 320 
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Glu Lys Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp 
325 330 ^ " 335 



Lys Leu Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu lie Tyr Thr 
340 345 350 



Glu Asp Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr 
355 360 365 



Leu Asn Phe Asp Lys Ala Val Phe Lys lie Asn lie Val Pro Lys Val 
370 375 380 



Asn Tyr Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala 
385 390 395 400 



Ala Asn Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr 
405 410 415 



Lys Leu Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys 
420 425 430 



Val Asp Gly lie lie Thr Ser Lys Thr Lys Ser Leu lie Glu Gly Arg 
435 440 445 



Tyr Gly Gly Phe Leu Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly 
450 455 460 



Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gin Cys lie Asn Leu 
465 470 475 480 



Asp Trp Asp Val He Arg Asp Lys Thr Lys Thr Lys He Glu Ser Leu 
485 490 495 



Lys Glu His Gly Pro He Lys Asn Lys Met Ser Glu Ser Pro Asn Lys 
500 505 510 



Thr Val Ser Glu Glu Lys Ala Lys Gin Tyr Leu Glu Glu Phe His Gin 
515 520 525 



Thr. Ala Leu Glu His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly 
530 535 540 



Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn 
545 550 555 560 
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Val Ala Gin Val lie Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr 
565 570 575 

Thr Ala Ala Leu Ser lie Leu Pro Gly lie Gly Ser Val Met Gly lie 
580 585 ~ 590 

Ala Asp Gly Ala Val His His Asn Thr Glu Glu He Val Ala Gin Ser 
595 600 605 

He Ala Leu Ser Ser Leu Met Val Ala Gin Ala He Pro Leu Val Gly 
610 615 620 

Glu Leu Val Asp He Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser He 
625 630 635 640 

He Asn Leu Phe Gin Val Val His Asn Ser Tyr Asn Arg Pro Leu Glu 



<210> 83 
<211> 1329 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 83 

ggatccatgc ctattactat taacaatttt cgttatagcg atcccgtcaa caatgacacc 60 

attatcatga tggaaccgcc atattgcaaa ggactggaca tttactataa agccttcaag 120 

attactgacc gcatttggat tgttccagag cgttacgagt tcgggacgaa accagaagat 180 

tttaacccgc cttcatcgct gatcgaagga gcatcagagt attacgatcc gaactatctg 240 

cgtacggaca gcgataaaga ccgcttctta cagaccatgg tcaaactttt taaccgtatt 300* 

aagaacaatg tggccggaga agcactcttg gataagatta tcaacgcgat tccatacctg 360 

ggcaattctt acagcctgct ggataaattt gacacaaata gtaattcagt cagctttaac 420 

ctgttagaac aagatccgag tggcgcaacc acgaagtctg ccatgctgac aaatctgatc 480 

atttttggtc caggtcctgt actgaataaa aatgaagtac gcggcatcgt tctccgcgtg 540 

gacaataaga actacttccc atgccgtgac ggcttcggtt cgatcatgca gatggctttc 600 

tgtccggagt acgttccgac gtttgataat gttattgaga atatcacgag tttaacaatc 660 

ggtaagtcaa aatattttca agatccggcc cttctcctta tgcatgaact gattcacgtg 720 

ctgcacggct tatatggtat gcaagtgtcc tcgcatgaaa tcattccgtc caaacaggaa 780 

atttatatgc agcataccta cccgatttca gctgaagagt tgtttacgtt tggtggccag 840 



WO 2006/059093 



147 



PCT/GB2005/004585 



gacgcgaatt tgatctccat cgacatcaaa aacgatctgt atgagaaaac attaaatgac 900 

tataaagcga ttgcgaacaa actgtctcag gtgactagct gcaacgatcc taacattgat 960 

attgattcct acaaacaaat ttatcaacag aaataccagt tcgataaaga cagcaatggt 1020 

cagtatatcg taaacgaaga taaatttcag atcctgtata acagcattat gtatggcttt 1080 

accgaaattg agttggggaa gaaatttaac attaaaaccc gtctgtctta ttttagtatg 1140 

aaccatgatc cggtgaaaat ccccaatctg cttgatgata ccatttataa tgataccgaa 1200 

gggttcaaca ttgaatctaa ggatctgaaa tccgaataca aaggccaaaa tatgcgtgtt 1260 

aatactaacg ctttccgtaa tgttgatggt agtggactcg tctcgaaact gattgggttg 1320 

tgtgtcgac 1329 

<210> 84 

<211> 2736 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 84 



ggatccatgc 


ctattactat 


taacaatttt 


cgttatagcg atcccgtcaa 


caatgacacc 


o u 


attatcatga 


tggaaccgcc 


atattgcaaa ggactggaca tttactataa 


agccttcaag 


120 


attactgacc 


gcatttggat 


tgttccagag 


cgttacgagt tcgggacgaa 


accagaagat 


180 


tttaacccgc 


cttcatcgct 


gatcgaagga 


gcatcagagt attacgatcc 


gaactatctg 


240 


cgtacggaca 


gcgataaaga 


ccgcttctta 


cagaccatgg tcaaactttt 


taaccgtatt 


300 


aagaacaatg 


tggccggaga 


agcactcttg 


gataagatta tcaacgcgat 


tccatacctg 


360 


ggcaattctt 


acagcctgct 


ggataaattt 


gacacaaata gtaattcagt 


cagctttaac 


420 


ctgttagaac 


aagatccgag 


tggcgcaacc 


acgaagtctg ccatgctgac 


aaatctgatc 


480 


atttttggtc 


caggtcctgt 


actgaataaa 


aatgaagtac gcggcatcgt 


tctccgcgtg 


540 


gacaataaga 


actacttccc 


atgccgtgac 


ggcttcggtt cgatcatgca gatggctttc 


600 


tgtccggagt 


acgttccgac 


gtttgataat 


gttattgaga atatcacgag 


tttaacaatc 


660 


ggtaagtcaa 


aatattttca 


agatccggcc 


cttctcctta tgcatgaact 


gattcacgtg 


720 


ctgcacggct 


tatatggtat 


gcaagtgtcc 


tcgcatgaaa tcattccgtc 


caaacaggaa 


780 


atttatatgc 


agcataccta 


cccgatttca gctgaagagt tgtttacgtt tggtggccag 


840 


gacgcgaatt 


tgatctccat 


cgacatcaaa 


aacgatctgt atgagaaaac 


attaaatgac 


900 


tataaagcga 


ttgcgaacaa 


actgtctcag gtgactagct gcaacgatcc 


taacattgat 


960 


attgattcct 


acaaacaaat 


ttatcaacag 


aaataccagt tcgataaaga 


cagcaatggt 


1020 
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cagtatatcg 


taaacgaaga 


taaatttcag 


atcctgtata 


acagcattat 


gtatggcttt 


1080 


accgaaattg 


agttggggaa 


gaaatttaac 


attaaaaccc 


gtctgtctta 


ttttagtatg 


1140 


aaccatgatc 


cggtgaaaat 


ccccaatctg 


cttgatgata 


ccatttataa 


tgataccgaa 


1200 


gggttcaaca 


ttgaatctaa 


ggatctgaaa 


tccgaataca 


aaggccaaaa 


tatgcgtgtt 


1260 


aatactaacg 


ctttccgtaa 


tgttgatggt 


agtggactcg 


tctcgaaact 


gattgggttg 


1320 


tgtgtcgacg 


gcatcattac 


ctccaaaact 


aaatctctga 


tagaaggtag 


atttggcggt 


1380 


ttcacgggcg 


cacgcaaatc 


agcgcgtaaa 


cgtaagaacc 


aggcgctagc 


gggcggtggc 


1440 


ggtagcggcg 


gtggcggtag 


cggcggtggc 


ggtagcgcac 


tagtgctgca 


gtgtatcaag 


1500 


gttaacaact 


gggatttatt 


cttcagcccg 


agtgaagaca 


acttcaccaa 


cgacctgaac 


1560 


aaaggtgaag 


aaatcacctc 


agatactaac 


atcgaagcag 


ccgaagaaaa 


catctcgctg 


1620 


gacctgatcc 


agcagtacta 


cctgaccttt 


aatttcgaca 


acgagccgga 


aaacatttct 


1680 


atcgaaaacc 


tgagctctga 


tatcatcggc 


cagctggaac 


tgatgccgaa 


catcgaacgt 


1740 


ttcccaaacg 


gtaaaaagta 


cgagctggac 


aaatatacca 


tgttccacta 


cctgcgcgcg 


1800 


caggaatttg 


aacacggcaa 


atcccgtatc 


gcactgacta 


actccgttaa 


cgaagctctg 


1860 


ctcaacccgt 


cccgtgtata 


caccttcttc 


tctagcgact 


acgtgaaaaa 


ggtcaacaaa 


1920 


gcgactgaag 


ctgcaatgtt 


cttgggttgg 


gttgaacagc 


ttgtttatga 


ttttaccgac 


1980 


gagacgtccg 


aagtatctac 


taccgacaaa 


attgcggata 


tcactatcat 


catcccgtac 


2040 


atcggtccgg 


ctctgaacat 


tggcaacatg 


ctgtacaaag 


acgacttcgt 


tggcgcactg 


2100 


atcttctccg 


gtgcggtgat 


cctgctggag 


ttcatcccgg 


aaatcgccat 


cccggtactg 


2160 


ggcacctttg 


ctctggtttc 


ttacattgca 


aacaaggttc 


tgactgtaca 


aaccatcgac 


2220 


aacgcgctga 


gcaaacgtaa 


cgaaaaatgg 


gatgaagttt 


acaaatatat 


cgtgaccaac 


2280 


tggctggcta 


aggttaatac 


tcagatcgac 


ctcatccgca 


aaaaaatgaa 


agaagcactg 


2340 


gaaaaccagg 


cggaagctac 


caaggcaatc 


attaactacc 


agtacaacca 


gtacaccgag 


2400 


gaagaaaaaa 


acaacatcaa 


cttcaacatc 


gacgatctgt 


cctctaaact 


gaacgaatcc 


2460 


atcaacaaag 


ctatgatcaa 


catcaacaag 


ttcctgaacc 


agtgctctgt 


aagctatctg 


2520 


atgaactcca 


tgatcccgta 


cggtgttaaa 


cgtctggagg 


acttcgatgc 


gtctctgaaa 


2580 


gacgccctgc 


tgaaatacat 


ttacgacaac 


cgtggcactc 


tgatcggtca 


ggttgatcgt 


2640 


ctgaaggaca 


aagtgaacaa 


taccttatcg 


accgacatcc 


cttttcagct 


cagtaaatat 


2700 


gtcgataacc 


aacgcctttt 


gtccactcta 


gactag 






2736 


<210> 85 
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<211> 911 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 85 

Gly Ser Met Pro lie Thr lie Asn Asn Phe Arg Tyr Ser Asp Pro Val 
1 5 10 ~ ' 15 



Asn Asn Asp Thr He He Met Met Glu Pro Pro Tyr Cys Lys Gly Leu 
20 25 30 



Asp He Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He Val 
35 40 45 



Pro Glu Arg Tyr Glu Phe Gly Thr Lys Pro Glu Asp Phe Asn Pro Pro 
50 55 60 



Ser Ser Leu He Glu Gly Ala Ser Glu Tyr Tyr Asp Pro Asn Tyr Leu 
65 70 75 ~ 80 



Arg Thr Asp Ser Asp Lys Asp Arg Phe Leu Gin Thr Met Val Lys Leu 
85 90 95 



Phe Asn Arg He Lys Asn Asn Val Ala Gly Glu Ala Leu Leu Asp Lys 
100 105 110 



He He Asn Ala He Pro Tyr Leu Gly Asn Ser Tyr Ser Leu Leu Asp 
115 120 125 



Lys Phe Asp Thr Asn Ser Asn Ser Val Ser Phe Asn Leu Leu Glu Gin 
130 135 140 



Asp Pro Ser Gly Ala Thr Thr Lys Ser Ala Met Leu Thr Asn Leu He 
145 150 155 160 



He Phe Gly Pro Gly Pro Val Leu Asn Lys Asn Glu Val Arg Gly He 
165 170 175 



Val. Leu Arg Val Asp Asn Lys Asn Tyr Phe Pro Cys Arg Asp Gly Phe 
180 185 190 



Gly Ser He Met Gin Met Ala Phe Cys Pro Glu Tyr Val Pro Thr Phe 
195 200 205 
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Asp Asn Val lie Glu Asn lie Thr Ser Leu Thr lie Gly Lys Ser Lys 
210 215 220 



Tyr Phe Gin Asp Pro Ala Leu Leu Leu Met His Glu Leu lie His Val 
225 230 235 240 



Leu His Gly Leu Tyr Gly Met Gin Val Ser Ser His Glu lie lie Pro 
245 250 255 



Ser Lys Gin Glu lie Tyr Met Gin His Thr Tyr Pro lie Ser Ala Glu 
260 265 270 



Glu Leu Phe Thr Phe Gly Gly Gin Asp Ala Asn Leu lie Ser lie Asp 
275 280 285 



lie Lys Asn Asp Leu Tyr Glu Lys Thr Leu Asn Asp Tyr Lys Ala lie 
290 295 300 



Ala Asn Lys Leu Ser Gin Val Thr Ser Cys Asn Asp Pro Asn lie Asp 
305 310 315 320 



lie Asp Ser Tyr Lys Gin lie Tyr Gin Gin Lys Tyr Gin Phe Asp Lys 
325 330 335 



Asp Ser Asn Gly Gin Tyr lie Val Asn Glu Asp Lys Phe Gin lie Leu 
340 345 350 



Tyr Asn Ser lie Met Tyr Gly Phe Thr Glu lie Glu Leu Gly Lys Lys 
355 360 365 



Phe Asn lie Lys Thr Arg Leu Ser Tyr Phe Ser Met Asn His Asp Pro 
370 375 380 



Val Lys lie Pro Asn Leu Leu Asp Asp Thr He Tyr Asn Asp Thr Glu 
385 390 395 400 



Gly Phe Asn He Glu Ser Lys Asp Leu Lys Ser Glu Tyr Lys Gly Gin 
405 410 415 



Asn Met Arg Val Asn Thr Asn Ala Phe Arg Asn Val Asp Gly Ser Gly 
420 425 430 



Leu Val Ser Lys Leu He Gly Leu Cys Val Asp Gly He He Thr Ser 
435 440 445 



Lys Thr Lys Ser Leu lie Glu Gly Arg Phe Gly Gly Phe Thr Gly Ala 
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450 455 460 



Arg Lys Ser Ala Arg Lys Arg Lys Asn Gin Ala Leu Ala Gly Gly Gly 
465 470 475 480 



Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu 
485 490 495 



Gin Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 
500 505 510 



Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu He Thr Ser Asp 
515 520 525 



Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu Asp Leu He Gin 
530 535 540 



Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn He Ser 
545 550 555 560 



He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu Glu Leu Met Pro 
565 570 575 



Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 
580 585 590 



Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu His Gly Lys Ser 
595 600 605 



Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser 
610 615 620 



Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys 
625 ~ 630 635 * * 640 



Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr 
645 650 655 



Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys He Ala 
660 665 670 



Asp He Thr He He He Pro Tyr He Gly Pro Ala Leu Asn He Gly 
675 680 685 



Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser Gly 

- "'690-f- ■■a 695 v • " 1 700' 
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Ala Val lie Leu Leu Glu Phe lie Pro Glu lie Ala lie Pro Val Leu 
705 710 715 720 



Gly Thr Phe Ala Leu Val Ser Tyr lie Ala Asn Lys Val Leu Thr Val 
725 730 735 



Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu 
740 745 750 



Val Tyr Lys Tyr lie Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gin 
755 760 765 



lie Asp Leu lie Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gin Ala 
770 775 780 



Glu Ala Thr Lys Ala lie He Asn Tyr Gin Tyr Asn Gin Tyr Thr Glu 
785 790 795 800 



Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp Leu Ser Ser Lys 
805 810 815 



Leu Asn Glu Ser He Asn Lys Ala Met He Asn He Asn Lys Phe Leu 
820 825 830 



Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met He Pro Tyr Gly 
835 840 845 



Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu 
850 855 860 



Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu lie Gly Gin Val Asp Arg 
865 870 875 880 



Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp lie Pro Phe Gin 
885 890 895 



Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser Thr Leu Asp 
900 905 910 



<210> 86 

<211> 180 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223«> 'Synthetic ■ ■ r 
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<400> 86 
ggatccacgc 


acgtcgacgc gattgatggt 


cgttttggcg 


gtttcacggg 


cgcacgcaaa 


60 


tcagcgcgta 


aacgtaagaa 


ccaggcgcta 


gcgggcggtg 


gcggtagcgg 


cggtggcggt 


120 


agcggcggtg gcggtagcgc actagtgctg 


cagacgcacg 


gtctagaatg 


ataaaagctt 


180 


<210> 87 
<211> 2715 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic 












<400> 87 
ggatccgaat 


tcatgccgat 


caccatcaac 


aacttcaact 


acagcgatcc 


ggtggataac 


60 


aaaaacatcc 


tgtacctgga 


tacccatctg 


aataccctgg 


cgaacgaacc 


ggaaaaagcg 


120 


tttcgtatca 


ccggcaacat 


ttgggttatt 


ccggatcgtt 


ttagccgtaa 


cagcaacccg 


180 


aatctgaata 


aaccgccgcg 


tgttaccagc 


ccgaaaagcg 


gttattacga 


tccgaactat 


240 


ctgagcaccg 


atagcgataa 


agataccttc 


ctgaaagaaa 


tcatcaaact 


gttcaaacgc 


300 


atcaacagcc 


gtgaaattgg 


cgaagaactg 


atctatcgcc 


tgagcaccga 


tattccgttt 


360 


ccgggcaaca 


acaacacccc 


gatcaacacc 


tttgatttcg 


atgtggattt 


caacagcgtt 


420 


gatgttaaaa 


cccgccaggg 


taacaattgg 


gtgaaaaccg 


gcagcattaa 


cccgagcgtg 


480 


attattaccg 


gtccgcgcga 


aaacattatt 


gatccggaaa 


ccagcacctt 


taaactgacc 


540 


aacaacacct 


ttgcggcgca 


ggaaggtttt 


ggcgcgctga 


gcattattag 


cattagcccg 


600 


cgctttatgc 


tgacctatag 


caacgcgacc 


aacgatgttg 


gtgaaggccg 


tttcagcaaa 


660 


agcgaatttt 


gcatggaccc 


gatcctgatc 


ctgatgcatg 


aactgaacca 


tgcgatgcat 


720 


aacctgtatg 


gcatcgcgat 


tccgaacgat 


cagaccatta 


gcagcgtgac 


cagcaacatc 


780 


ttttacagcc 


agtacaacgt 


gaaactggaa 


tatgcggaaa 


tctatgcgtt 


tggcggtccg 


840 


accattgatc 


tgattccgaa 


aagcgcgcgc 


aaatacttcg 


aagaaaaagc 


gctggattac 


900 


tatcgcagca 


ttgcgaaacg 


tctgaacagc 


attaccaccg 


cgaatccgag 


cagcttcaac 


960 


aaatatatcg 


gcgaatataa 


acagaaactg 


atccgcaaat 


atcgctttgt 


ggtggaaagc 


1020 


agcggcgaag 


ttaccgttaa 


ccgcaataaa 


ttcgtggaac 


tgtacaacga 


actgacccag 


1080 


atcttcaccg 


aatttaacta 


tgcgaaaatc 


tataacgtgc 


agaaccgtaa 


aatctacctg 


1140 


agcaacgtgt 


ataccccggt 


gaccgcgaat 


attctggatg 


ataacgtgta 


cgatatccag 


1200 


aacggcttta 


acatcccgaa 


aagcaacctg 


aacgttctgt 


ttatgggcca 


gaacctgagc 


1260 


- ^cgtaatccgg 


cgctgcgtaa 


agtgaacccg 


gaaaacatgc 


tgtacctgtt 


caccaaattt 


1320 
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tgcgtcgacg 


cgattgatgg 


tcgttttggc 


ggtttcacgg gcgcacgcaa 


atcagcgcgt 


1380 


aaacgtaaga 


accaggcgct 


agcgggcggt 


ggcggtagcg gcggtggcgg 


tagcggcggt 


1440 


ggcggtagcg 


cactagtgct 


gcagtgtcgt 


gaactgctgg tgaaaaacac 


cgatctgccg 


1500 


tttattggcg 


atatcagcga 


tgtgaaaacc 


gatatcttcc tgcgcaaaga 


tatcaacgaa 


1560 


gaaaccgaag 


tgatctacta 


cccggataac 


gtgagcgttg atcaggtgat 


cctgagcaaa 


1620 


aacaccagcg 


aacatggtca 


gctggatctg 


ctgtatccga gcattgatag cgaaagcgaa 


1680 


attctgccgg 


gcgaaaacca 


ggtgttttac 


gataaccgta cccagaacgt 


ggattacctg 


1740 


aacagctatt 


actacctgga 


aagccagaaa 


ctgagcgata acgtggaaga 


ttttaccttt 


1800 


acccgcagca 


ttgaagaagc 


gctggataac 


agcgcgaaag tttacaccta ttttccgacc 


1860 


ctggcgaaca 


aagttaatgc 


gggtgttcag 


ggcggtctgt ttctgatgtg ggcgaacgat 


1920 


gtggtggaag 


atttcaccac 


caacatcctg 


cgtaaagata ccctggataa 


aatcagcgat 


1980 


gttagcgcga 


ttattccgta 


tattggtccg 


gcgctgaaca ttagcaatag 


cgtgcgtcgt 


2040 


ggcaatttta 


ccgaagcgtt 


tgcggttacc 


ggtgtgacca ttctgctgga agcgtttccg 


2100 


gaatttacca 


ttccggcgct 


gggtgcgttt 


gtgatctata gcaaagtgca ggaacgcaac 


2160 


gaaatcatca 


aaaccatcga 


taactgcctg 


gaacagcgta ttaaacgctg 


gaaagatagc 


2220 


ta,tgaatgga 


tgatgggcac 


ctggctgagc 


cgtattatca cccagttcaa 


caacatcagc 


2280 


taccagatgt 


acgatagcct 


gaactatcag 


gcgggtgcga ttaaagcgaa 


aatcgatctg 


2340 


gaatacaaaa 


aatacagcgg 


cagcgataaa 


gaaaacatca aaagccaggt 


tgaaaacctg 


2400 


aaaaacagcc 


tggatgtgaa 


aattagcgaa 


gcgatgaata acatcaacaa 


attcatccgc 


2460 


gaatgcagcg 


tgacctacct 


gttcaaaaac 


atgctgccga aagtgatcga 


tgaactgaac 


2520 


gaatttgatc 


gcaacaccaa 


agcgaaactg 


atcaacctga tcgatagcca 


caacattatt 


2580 


ctggtgggcg 


aagtggataa 


actgaaagcg 


aaagttaaca acagcttcca 


gaacaccatc 


2640 


c'cgtttaaca 


tcttcagcta 


taccaacaac 


agcctgctga aagatatcat 


caacgaatac 


2700 


ttcaatctag 


actag 








2715 



<210> 88 

<211> 904 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 88 



Gly Ser Glu Phe Met Pro- He Thr lie Asra-Asn Phe Asn Tyr; Ser Asp 
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10 15 



Pro Val Asp Asn Lys Asn lie Leu Tyr Leu Asp Thr His Leu Asn Thr 
20 25 30 



Leu Ala Asn Glu Pro Glu Lys Ala Phe Arg lie Thr Gly Asn He Trp 
35 40 45 



Val He Pro Asp Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 
50 55 60 



Pro Pro Arg Val Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr 
65 70 75 80 



Leu Ser Thr Asp Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys 
85 90 95 



Leu Phe Lys Arg He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr 
100 105 110 



Arg Leu Ser Thr Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He 
115 120 125 



Asn Thr Phe Asp Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr 
130 135 140 



Arg Gin Gly Asn Asn Trp Val Lys Thr Gly Ser He Ash Pro Ser Val 
145 150 155 160 



He He Thr Gly Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr 
165 170 175 



Phe Lys Leu Thr Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala 
180 185 190 



Leu Ser He He Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn 
195 200 205 



Ala Thr Asn Asp Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys 
210 215 220 



Met Asp Pro He Leu He Leu Met His Glu Leu Asn His Ala Met His 
225 230 235 240 



Asn Leu Tyr Gly lie Ala He Pro Asn Asp Gin Thr He Ser Ser Val 

* ; 245^ -r 255 
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Thr Ser Asn lie Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala 
260 265 270 



Glu He Tyr Ala Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser 
275 280 285 



Ala Arg Lys Tyr Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He 
290 295 300 



Ala Lys Arg Leu Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn 
305 310 315 320 



Lys Tyr He Gly Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe 
325 330 335 



Val Val Glu Ser Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val 
340 345 350 



Glu Leu Tyr Asn Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala 
355 360 365 



Lys He Tyr Asn Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr 
370 375 380 



Thr Pro Val Thr Ala Asn He Leu Asp Asp Asn Val Tyr Asp lie Gin 
385 390 395 400 



Asn Gly Phe Asn He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly 
405 410 415 



Gin Asn Leu Ser Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn 
420 425 430 



Met Leu Tyr Leu- Phe Thr Lys Phe Cys Val Asp Ala He Asp Gly Arg 
435 440 445 



Phe Gly Gly Phe Thr Gly Ala Arg Lys Ser Ala Arg Lys Arg Lys Asn 
450 455 460 



Gin Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
465 470 475 480 



Gly Gly Ser Ala Leu Val Leu Gin Cys Arg Glu Leu Leu Val Lys Asn 
485 . . 490 495 
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Thr Asp Leu Pro Phe He Gly Asp He Ser Asp Val Lys Thr Asp He 
500 505 510 



Phe Leu Arg Lys Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro 
515 520 525 



Asp Asn Val Ser Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu 
530 535 540 



His Gly Gin Leu Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu 
545 550 555 560 



He Leu Pro Gly Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn 
565 570 575 



Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser 
580 585 590 



Asp Asn Val Glu Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu 
595 600 605 



Asp Asn Ser Ala Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys 
610 615 620 



Val Asn Ala Gly Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp 
625 630 635 640 



Val Val Glu Asp Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp 
645 650 655 



Lys He Ser Asp Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu 
660 665 670 



Asn He Ser Asn Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala 
675 680 685 



Val Thr Gly Val Thr He Leu Leu Glu Ala Phe Pro Glu . Phe Thr He 
690 695 700 



Pro Ala Leu Gly Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn 
705 710 715 720 



Glu He He Lys Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg 
725 730 735 
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Trp Lys Asp Ser Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He 
740 745 750 



He Thr Gin Phe Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn 
755 760 765 



Tyr Gin Ala Gly Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys 
770 775 780 



Tyr Ser Gly Ser Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu 
785 790 795 800 



Lys Asn Ser Leu Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn 
805 810 815 



Lys Phe He Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu 
820 825 830 



Pro Lys Val He Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala 
835 840 845 



Lys Leu He Asn Leu He Asp Ser His Asn He He Leu Val Gly Glu 
850 855 860 



Val Asp Lys Leu Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He 
865 * " 870 875 880 



Pro Phe Asn lie Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He 
885 890 895 



He Asn Glu Tyr Phe Asn Leu Asp 
900 
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